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METHOD OF DIAGNOSING, 
MONITORING , STAGING, IMAGING AND TREATING PROSTATE CANCER 

FIELD OF THE INVENTION 

This invention relates, in part, to newly developed 
5 assays for detecting, diagnosing, monitoring, staging, 
prognosticating, imaging and treating cancers, particularly 
prostate cancer. 
BACKGROUND OF THE INVENTION 

Cancer of the prostate is the most prevalent malignancy 

10 in adult males, excluding skin cancer, and is an increasingly 
prevalent health problem in the United States. In 1996, it 
was estimated that 41,400 deaths would result from this 
disease in the United States alone, indicating that prostate 
cancer is second only to lung cancer as the most common cause 

15 of death in the same population. If diagnosed and treated 
early, when the cancer is still confined to the prostate, the 
chances of cure is significantly higher. 

Treatment decisions for an individual are linked to the 
stage of prostate cancer present in that individual. A common 

20 classification of the spread of prostate cancer was developed 
by the American Urological Association (AUA) . The AUA system 
divides prostate tumors into four stages, A to D. Stage A, 
microscopic cancer within prostate, is further subdivided into 
stages Al and A2 . Sub-stage Al is a well-differentiated 

25 cancer confined to one site within the prostate. Treatment 
is generally observation, radical prostatectomy, or radiation. 
Sub-stage A2 is a moderately to poorly differentiated cancer 
at multiple sites within the prostate. Treatment is radical 
prostatectomy or radiation. Stage B, palpable lump within the 

30 prostate, is also further subdivided into sub-stages Bl and 
B2 . In sub-stage Bl, the cancer forms a small nodule in one 



BNSDOCID: <WO 00231 1 1 A1J_> 



WO 00/23111 PCT/US99/24331 

- 2 - 

lobe of the prostate. In sub-stage B2, the cancer forms large 
or. multiple nodules, or occurs in both lobes of ttte prostate. 
Treatment for sub-stages Bl and B2 is either radical 
prostatectomy or radiation. Stage C is a large cancer mass 
5 involving most or all of the prostate and is also further 
subdivided into two sub-stages. In sub-stage CI, the cancer 
forms a continuous mass that may have extended beyond the 
prostate. In sub-stage C2, the cancer forms a continuous mass 
that invades the surrounding tissue. Treatment for both these 
10 sub-stages is radiation with or without drugs to address the 
cancer. The fourth stage, Stage D is metastatic cancer and 
is also subdivided into two sub-stages. In sub-stage Dl, the 
cancer appears in the lymph nodes of the pelvis. In sub-stage 
D2, the cancer involves tissues beyond lymph nodes. Treatment 
15 for both of these sub-stages is systemic drugs to address the 
cancer as well as pain. 

However, current prostate cancer staging methods are 
limited. As many as 50% of prostate cancers initially staged 
as A2, B, or C are actually stage D, metastatic. Discovery 
20 of metastasis is significant because patients with metastatic 
cancers have a poorer prognosis and require significantly 
different therapy than those with localized cancers. The five 
year survival rates for patients with localized and metastatic 
prostate cancers are 93% and 29%, respectively. 
25 Accordingly, there is a great need for more sensitive 

and accurate methods for the staging of a cancer in a human 
to determine whether or not such cancer has metastasized and 
for monitoring the progress of a cancer in a human which has 
not metastasized for the onset of metastasis. 
30 xt has now been found that a number of proteins in the 

public domain are useful as diagnostic markers for prostate 
cancer. These diagnostic markers are referred to herein as 
cancer specific genes or CSGs and include, but are not limited 
to: Prol09 which is a human zinc-a 2-glycoprotein (Freje et 
35 al. Genomics 1993 18 ( 3 ) : 57 5-587 ) ; Proll2 which is a human 
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cysteine-rich protein with a zinc-finger motif (Liebhaber et 
al. Nucleic Acid Research 1990 18 (13) : 3871-3879; W69514772 and 
W09845436) ; Prolll which is a prostate-specific 
transglutaminase (Dubbink et al . Genomics 1998 51 (3) : 434-444) ; 
5 Proll5 which is a novel serine protease with transmembrane, 
LDLR, and SRCR domains and maps to 21q22.3 ( Paoloni-Giacobino 
et al. Genomics 1997 4 4 ( 3 ) : 309-320 ; W09837418 and WO987093) ; 
ProllO which is a human breast carcinoma fatty acid synthase 
(U.S. Patent 5,665,874 and WO9403599) ; Proll3 which is a 

10 homeobox gene, HOXB13 (Steinicki et al. J. Invest. Dermatol. 
1998 111:57-63); Proll4 which is a human tetraspan NET-1 
(W09839446) ; and Proll8 which is a human JM27 protein 
(W09845435) . ESTs for these CSGs are set forth in SEQ ID NO: 
1, 3, 5, 7, 9, 11, 13 and 15 while the full length contigs for 

15 these CSGs are set forth in SEQ ID NO:2, 4, 6, 8, 10, 12, 14 
and 16, respectively. Additional CSGs for use in the present 
invention are depicted herein in SEQ ID NO: 17, 18, 19 and 20. 

In the present invention, methods are provided for 
detecting, diagnosing, monitoring, staging, prognosticating, 

20 imaging and treating prostate cancer via the cancer specific 
genes referred to herein as CSGs. For purposes of the present 
invention, CSG refers, among other things, to native protein 
expressed by the gene comprising a polynucleotide sequence of 
SEQ ID N0:1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 

25 16, 17, 18, 19 or 20. By "CSG" it is also meant herein 
polynucleotides which, due to degeneracy in genetic coding, 
comprise variations in nucleotide sequence as compared to SEQ 
ID NO: 1-20, but which still encode the same protein. In the 
alternative, what is meant by CSG as used herein, means the 
.30 native mRNA encoded by the gene comprising the polynucleotide 
sequence of SEQ ID NO:l, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 
13, 14, 15, 16, 17, 18, 19 or 20, levels of the gene 
comprising the polynucleotide sequence of SEQ ID NO:l, 2, 3, 
4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 

35 20, or levels of a polynucleotide which is capable of 
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20 



25 



hybridizing under stringent conditions to the antisense 
sequence of SEQ ID NO:l, 2, 3, 4, 5, 6, 7, 8, 9, *0, 11, 12, 
13, 14, 15, 16, 17, 18, 19 or 20. 

Other objects, features, advantages and aspects of the 
present invention will become apparent to those of skill in 
the art from the following description. It should be 
understood, however, that the following description and the 
specific examples, while indicating preferred embodiments of 
the invention are given by way of illustration only. Various 
changes and modifications within the spirit and scope of the 
disclosed invention will become readily apparent to those 
skilled in the art from reading the following description and 
from reading the other parts of the present disclosure. 

SUMMARY OF THE INVENTION 

Toward these ends, and others, it is an object of the 
present invention to provide a method for diagnosing the 
presence of prostate cancer by analyzing for changes in levels 
of CSG in cells, tissues or bodily fluids compared with levels 
of CSG in preferably the same cells, tissues, or bodily fluid 
type of a normal human control, wherein a change in levels of 
CSG in the patient versus the normal human control is 
associated with prostate cancer. 

Further provided is a method of diagnosing metastatic 
prostate cancer in a patient having prostate cancer which is 
not known to have metastasized by identifying a human patient 
suspected of having prostate cancer that has metastasized; 
analyzing a sample of cells, tissues, or bodily fluid from 
such patient for CSG; comparing the CSG levels in such cells, 
tissues, or bodily fluid with levels of CSG in preferably the 
same cells, tissues, or bodily fluid type of a normal human 
control, wherein an increase in CSG levels in the patient 
versus the normal human control is associated with prostate 
cancer which has metastasized. 
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Also provided by the invention is a method of staging 
prostate cancer in a human which has such * cancer by 
identifying a human patient having such cancer; analyzing a 
sample of cells, tissues, or bodily fluid from such patient 
5 for CSG; comparing CSG levels in such cells, tissues, or 
bodily fluid with levels of CSG in preferably the same cells, 
tissues, or bodily fluid type of a normal human control 
sample, wherein an increase in CSG levels in the patient 
versus the normal human control is associated with a cancer 

10 which is progressing and a decrease in the levels of CSG is 
associated with a cancer which is regressing or in remission. 

Further provided is a method of monitoring prostate 
cancer in a human having such cancer for the onset of 
metastasis. The method comprises identifying a human patient 

15 having such cancer that is not known to have metastasized; 
periodically analyzing a sample of cells, tissues, or bodily 
fluid from such patient for CSG; comparing the CSG levels in 
such cells, tissue, or bodily fluid with levels of CSG in 
preferably the same cells, tissues, or bodily fluid type of 

20 a normal human control sample, wherein an increase in CSG 
levels in the patient versus the normal human control is 
associated with a cancer which has metastasized. 

Further provided is a method of monitoring the change 
in stage of prostate cancer in a human having such cancer by 

25 looking at levels of CSG in a human having such cancer. The 
method comprises identifying a human patient having such 
cancer; periodically analyzing a sample of cells, tissues, or 
bodily fluid from such patient for CSG; comparing the CSG 
levels in such cells, tissue, or bodily fluid with levels of 

30 CSG in preferably the same cells, tissues, or bodily fluid 
type of a normal human control sample, wherein an increase in 
CSG levels in the patient versus the normal human control is 
associated with a cancer which is progressing and a decrease 
in the levels of CSG is associated with a cancer which is 

35 regressing or in remission. 
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Further provided are methods of designing new 
therapeutic agents targeted to a CSG for use in imaging and 
treating prostate cancer. For example, in one embodiment, 
therapeutic agents such as antibodies targeted against CSG or 
5 fragments of such antibodies can be used to detect or image 
localization of CSG in a patient for the purpose of detecting 
or diagnosing a disease or condition. Such antibodies can be 
polyclonal, monoclonal, or omniclonal or prepared by molecular 
biology techniques. The term "antibody" , as used herein and 
10 throughout the instant specification is also meant to include 
aptamers and single-stranded oligonucleotides such as those 
derived from an in vitro evolution protocol referred to as 
SELEX and well known to those skilled in the art. Antibodies 
can be labeled with a variety of detectable labels including, 
15 but not limited to, radioisotopes and paramagnetic metals. 
Therapeutics agents such as antibodies or fragments thereof 
can also be used in the treatment of diseases characterized 
by expression of CSG. In these applications, the antibody can 
be used without or with deri vatization to a cytotoxic agent 
20 such as a radioisotope, enzyme, toxin, drug or a prodrug. 

Other objects, features, advantages and aspects of the 
present invention will become apparent to those of skill in 
the art from the following description. It should be 
understood, however, that the following description and the 
25 specific examples, while indicating preferred embodiments of 
the invention, are given by way of illustration only. Various 
changes and modifications within the spirit and scope of the 
disclosed invention will become readily apparent to those 
skilled in the art from reading the following description and 
30 from reading the other parts of the present disclosure. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to diagnostic assays and 
methods, both quantitative and qualitative for detecting, 
diagnosing, monitoring, staging and prognosticating cancers 
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by comparing levels of CSG in a human patient with those of 
CSG in a normal human control. For purposes of *the present 
invention, what is meant be CSG levels is, among other things, 
native protein expressed by the gene comprising a 
5 polynucleotide sequence of SEQ ID NO:l, 2, 3, 4, 5, 6, 7, 8, 
9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20. By "CSG'' it 
is also meant herein polynucleotides which, due to degeneracy 
in genetic coding, comprise variations in nucleotide sequence 
as compared to SEQ ID NO: 1-20, but which still encode the 

10 same protein. The native protein being detected, may be 
whole, a breakdown product, a complex of molecules or 
chemically modified. In the alternative, what is meant by 
CSG as used herein, means the native mRNA encoded by the gene 
comprising the polynucleotide sequence of SEQ ID NO:l, 2, 3, 

15 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 
20, levels of the gene comprising the polynucleotide sequence 
of SEQ ID NO:l, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 
15, 16, 17, 18, 19 or 20, or levels of a polynucleotide which 
is capable of hybridizing under stringent conditions to the 

20 antisense sequence of SEQ ID NO:l, 2, 3, 4, 5, 6, 7, 8, 9, 10, 
11, 12, 13, 14, 15, 16, 17, 18, 19 or 20. Such levels are 
preferably determined in at least one of, cells, tissues 
and/or bodily fluids, including determination of normal and 
abnormal levels. Thus, for instance, a diagnostic assay in 

25 accordance with the invention for diagnosing overexpression 
of CSG protein compared to normal control bodily fluids, 
cells, or tissue samples may be used to diagnose the presence 
of prostate cancer. 

All the methods of the present invention may optionally 

30 include determining the levels of other cancer markers as well 
as CSG. Other cancer markers, in addition to CSG, useful in 
the present invention will depend on the cancer being tested 
and are known to those of skill in the art. 
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Diagnostic Assays 

The present invention provides methods for diagnosing the 
presence of prostate cancer by analyzing for changes in levels 
of CSG in cells, tissues or bodily fluids compared with levels 
5 of CSG in cells, tissues or bodily fluids of preferably the 
same type from a normal human control, wherein an increase in 
levels of CSG in the patient versus the normal human control 
is associated with the presence of prostate cancer. 

Without limiting the instant invention, typically, for 
10 a quantitative diagnostic assay a positive result indicating 
the patient being tested has cancer is one in which cells, 
tissues or bodily fluid levels of the cancer marker, such as 
CSG, are at least two times higher, and most preferably are 
at least five times higher, than in preferably the same cells, 
15 tissues or bodily fluid of a normal human control. 

The present invention also provides a method of 
diagnosing metastatic prostate cancer in a patient having 
prostate cancer which has not yet metastasized for the onset 
of metastasis. In the method of the present invention, a 
20 human cancer patient suspected of having prostate cancer which 
may have metastasized (but which was not previously known to 
have metastasized) is identified. This is accomplished by a 
variety of means known to those of skill in the art. 

In the present invention, determining the presence of CSG 
25 levels in cells, tissues or bodily fluid, is particularly 
useful for discriminating between prostate cancer which has 
not metastasized and prostate cancer which has metastasized. 
Existing techniques have difficulty discriminating between 
prostate cancer which has metastasized and prostate cancer 
-30 which has not metastasized and proper treatment selection is 
often dependent upon such knowledge. 

In the present invention, the cancer marker levels 
measured in such cells, tissues or bodily fluid is CSG, and 
are compared with levels of CSG in preferably the same cells, 
35 tissue or bodily fluid type of a normal human control. That 
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is, if the cancer marker being observed is just CSG in serum, 
this level is preferably compared with the leve^ of CSG in 
serum of a normal human control. An increase in the CSG in 
the patient versus the normal human control is associated with 
5 prostate cancer which has metastasized. 

Without limiting the instant invention, typically, for 
a quantitative diagnostic assay a positive result indicating 
the cancer in the patient being tested or monitored has 
metastasized is one in which cells, tissues or bodily fluid 

10 levels of the cancer marker, such as CSG, are at least two 
times higher, and most preferably are at least five times 
higher, than in preferably the same cells, tissues or bodily 
fluid of a normal patient. 

Normal human control as used herein includes a human 

15 patient without cancer and/or non cancerous samples from the 
patient; in the methods for diagnosing or monitoring for 
metastasis, normal human control may preferably also include 
samples from a human patient that is determined by reliable 
methods to have prostate cancer which has not metastasized. 

20 Staging 

The invention also provides a method of staging prostate 
cancer in a human patient. The method comprises identifying 
a human patient having such cancer and analyzing cells, 
tissues or bodily fluid from such human patient for CSG. The 

25 CSG levels determined in the patient are then compared with 
levels of CSG in preferably the same cells, tissues or bodily 
fluid type of a normal human control, wherein an increase in 
CSG levels in the human patient versus the normal human 
control is associated with a cancer which is progressing and 

30 a decrease in the levels of CSG (but still increased over true 
normal levels) is associated with a cancer which is regressing 
or in remission. 
Monl toring 

Further provided is a method of monitoring prostate 
35 cancer in a human patient having such cancer for the onset of 
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metastasis. The method comprises identifying a human patient 
having such cancer that is not known to have metastasized; 
periodically analyzing cells, tissues or bodily fluid from 
such human patient for CSG; and comparing the CSG levels 
5 determined in the human patient with levels of CSG in 
preferably the same cells, tissues or bodily fluid type of a 
normal human control, wherein an increase in CSG levels in the 
human patient versus the normal human control is associated 
with a cancer which has metastasized. In this method, normal 
10 human control samples may also include prior patient samples. 

Further provided by this invention is a method of 
monitoring the change in stage of prostate cancer in a human 
patient having such cancer. The method comprises identifying 
a human patient having such cancer; periodically analyzing 
15 cells, tissues or bodily fluid from such human patient for 
CSG; and comparing the CSG levels determined in the human 
patient with levels of CSG in preferably the same cells, 
tissues or bodily fluid type of a normal human control, 
wherein an increase in CSG levels in the human patient versus 
20 the normal human control is associated with a cancer which is 
progressing in stage and a decrease in the levels of CSG is 
associated with a cancer which is regressing in stage or in 
remission. In this method, normal human control samples may 
also include prior patient samples. 
25 Monitoring a patient for onset of metastasis is periodic 

and preferably done on a quarterly basis. However, this may 
be more or less frequent depending on the cancer, the 
particular patient, and the stage of the cancer. 
Assay Techniques 
30 Assay techniques that can be used to determine levels of 

gene expression (including protein levels), such as CSG of the 
present invention, in a sample derived from a patient are well 
known to those of skill in the art. Such assay methods 
include, without limitation, radioimmunoassays, reverse 
35 transcriptase PCR (RT-PCR) assays, immunohistochemistry 
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assays, in situ hybridization assays, competitive-binding 
assays, Western Blot analyses, ELISA assays afrd proteomic 
approaches: two-dimensional gel electrophoresis (2D 
electrophoresis) and non-gel based approaches such as mass 
5 spectrometry or protein interaction profiling. Among these, 
ELISAs are frequently preferred to diagnose a gene's expressed 
protein in biological fluids. 

An ELISA assay initially comprises preparing an antibody, 
if not readily available from a commercial source, specific 

10 to CSG, preferably a monoclonal antibody. In addition a 
reporter antibody generally is prepared which binds 
specifically to CSG. The reporter antibody is attached to a 
detectable reagent such as radioactive, fluorescent or 
enzymatic reagent, for example horseradish peroxidase enzyme 

15 or alkaline phosphatase. 

To carry out the ELISA, antibody specific to CSG is 
incubated on a solid support, e.g. a polystyrene dish, that 
binds the antibody. Any free protein binding sites on the 
dish are then covered by incubating with a non-specific 

20 protein such as bovine serum albumin. Next, the sample to be 
analyzed is incubated in the dish, during which time CSG binds 
to the specific antibody attached to the polystyrene dish. 
Unbound sample is washed out with buffer. A reporter antibody 
specifically directed to CSG and linked to a detectable 

25 reagent such as horseradish peroxidase is placed in the dish 
resulting in binding of the reporter antibody to any 
monoclonal antibody bound to CSG. Unattached reporter 
antibody is then washed out. Reagents for peroxidase 
activity, including a colorimetric substrate are then added 
-30 to the dish. Immobilized peroxidase, linked to CSG 

antibodies, produces a colored reaction product. The amount 
of color developed in a given time period is proportional to 
the amount of CSG protein present in the sample. Quantitative 
results typically are obtained by reference to a standard 

35 curve. 
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A competition assay can also be employed wherein 
antibodies specific to CSG are attached to a solids-support and 
labeled CSG and a sample derived from the host are passed over 
the solid support. The amount of label detected which is 
5 attached to the solid support can be correlated to a quantity 
of CSG in the sample. 

Nucleic acid methods can also be used to detect CSG mRNA 
as a marker for prostate cancer. Polymerase chain reaction 
(PCR) and other nucleic acid methods, such as ligase chain 
10 reaction (LCR) and nucleic acid sequence based amplification 
(NASABA) , can be used to detect malignant cells for diagnosis 
and monitoring of various malignancies. For example, reverse- 
transcriptase PCR (RT-PCR) is a powerful technique which can 
be used to detect the presence of a specific mRNA population 
15 in a complex mixture of thousands of other mRNA species. In 
RT-PCR, an mRNA species is first reverse transcribed to 
complementary DNA (cDNA) with use of the enzyme reverse 
transcriptase; the cDNA is then amplified as in a standard PCR 
reaction. RT-PCR can thus reveal by amplification the 
20 presence of a single species of mRNA. Accordingly, if the 
mRNA is highly specific for the cell that produces it, RT-PCR 
can be used to identify the presence of a specific type of 
cell. 

Hybridization to clones or oligonucleotides arrayed on 
25 a solid support (i.e. gridding) can be used to both detect the 
expression of and quantitate the level of expression of that 
gene. In this approach, a cDNA encoding the CSG gene is fixed 
to a substrate. The substrate may be of any suitable type 
including but not limited to glass, nitrocellulose, nylon or 
30 plastic. At least a portion of the DNA encoding the CSG gene 
is attached to the substrate and then incubated with the 
analyte, which may be RNA or a complementary DNA ( c DNA ) copy 
of the RNA, isolated from the tissue of interest. 
Hybridization between the substrate bound DNA and the analyte 
35 can be detected and quantitated by several means including but 
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not limited to radioactive labeling or fluorescence labeling 
of the analyte or a secondary molecule designed t% detect the 
hybrid. Quantitation of the level of gene expression can be 
done by comparison of the intensity of the signal from the 
5 analyte compared with that determined from known standards. 
The standards can be obtained by in vitro transcription of the 
target gene, quantitating the yield, and then using that 
material to generate a standard curve. 

Of the proteomic approaches, 2D electrophoresis is a 

10 technique well known to those in the art. Isolation of 
individual proteins from a sample such as serum is 
accomplished using sequential separation of proteins by 
different characteristics usually on polyacrylamide gels. 
First, proteins are separated by size using an electric 

15 current. The current acts uniformly on all proteins, so 
smaller proteins move farther on the gel than larger proteins. 
The second dimension applies a current perpendicular to the 
first and separates proteins not on the basis of size but on 
the specific electric charge carried by each protein. Since 

20 no two proteins with different sequences are identical on the 
basis of both size and charge, the result of a 2D separation 
is a square gel in which each protein occupies a unique spot. 
Analysis of the spots with chemical or antibody probes, or 
subsequent protein microsequencing can reveal the relative 

25 abundance of a given protein and the identity of the proteins 
in the sample. 

The above tests can be carried out on samples derived 
from a variety of cells, bodily fluids and/or tissue extracts 
such as homogenates or solubilized tissue obtained from a 

30 patient. Tissue extracts are obtained routinely from tissue 
biopsy and autopsy material. Bodily fluids useful in the 
present invention include blood, urine, saliva or any other 
bodily secretion or derivative thereof. By blood it is meant 
to include whole blood, plasma, serum or any derivative of 

35 blood. 
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In Vivo Targeting of CSGs 

Identification of these CSGs is also useful in the 
rational design of new therapeutics for imaging and treating 
cancers, and in particular prostate cancer. For example, in 
5 one embodiment, antibodies which specifically bind to CSG can 
be raised and used in vivo in patients suspected of suffering 
from prostate cancer. Antibodies which specifically bind a 
CSG can be injected into a patient suspected of having 
prostate cancer for diagnostic and/or therapeutic purposes. 
10 The preparation and use of antibodies for in vivo diagnosis 
is well known in the art. For example, antibody-chelators 
labeled with Indium-Ill have been described for use in the 
radioimmunoscintographic imaging of carcinoembryonic antigen 
expressing tumors (Sumerdon et al. Nucl . Med. Biol. 1990 
15 17:247-254). In particular, these antibody-chelators have 
been used in detecting tumors in patients suspected of having 
recurrent colorectal cancer (Griffin et al. J. Clin. One. 1991 
9:631-640). Antibodies with paramagnetic ions as labels for 
use in magnetic resonance imaging have also been described 
20 (Lauffer, R.B. Magnetic Resonance in Medicine 1991 22:339- 
342). Antibodies directed against CSG can be used in a 
similar manner. Labeled antibodies which specifically bind 
CSG can be injected into patients suspected of having prostate 
cancer for the purpose of diagnosing or staging of the disease 
25 status of the patient. The label used will be selected in 
accordance with the imaging modality to be used. For example, 
radioactive labels such as Indium-Ill, Technetium-99m or 
Iodine-131 can be used for planar scans or single photon 
emission computed tomography (SPECT) . Positron emitting 
30 labels such as Fluorine-19 can be used in positron emission 
tomography. Paramagnetic ions such as Gadlinium (III) or 
Manganese (II) can be used in magnetic resonance imaging 
(MRI) . Localization of the label permits determination of the 
spread of the cancer. The amount of label within an organ or 
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tissue also allows determination of the presence or absence 
of cancer in that organ or tissue. *" 

For patients diagnosed with prostate cancer, injection 
of an antibody which specifically binds CSG can also have a 
5 therapeutic benefit. The antibody may exert its therapeutic 
effect alone. Alternatively, the antibody can be conjugated 
to a cytotoxic agent such as a drug, toxin or radionuclide to 
enhance its therapeutic effect. Drug monoclonal antibodies 
have been described in the art for example by Garnett and 

10 Baldwin, Cancer Research 1986 46:2407-2412. The use of toxins 
conjugated to monoclonal antibodies for the therapy of various 
cancers has also been described by Pastan et al . Cell 1986 
47:641-648. Yttrium-90 labeled monoclonal antibodies have 
been described for maximization of dose delivered to the tumor 

15 while limiting toxicity to normal tissues (Goodwin and Meares 
Cancer Supplement 1997 80:2675-2680). Other cytotoxic 
radionuclides including, but not limited to Copper-67, Iodine- 
131 and Rhenium-186 can also be used for labeling of 
antibodies against CSG. 

20 Antibodies which can be used in these in vivo methods 

include polyclonal, monoclonal and omniclonal antibodies and 
antibodies prepared via molecular biology techniques. 
Antibody fragments and aptamers and single-stranded 
oligonucleotides such as those derived from an in vitro 

25 evolution protocol referred to as SELEX and well known to 
those skilled in the art can also be used. 

Small molecules predicted via computer imaging to 
specifically bind to regions of CSGs can also be designed and 
synthesized and tested for use in the imaging and treatment 
-30 of prostate cancer. Further, libraries of molecules can be 
screened for potential anticancer agents by assessing the 
ability of the molecule to bind to CSGs identified herein. 
Molecules identified in the library as being capable of 
binding to CSG are key candidates for further evaluation for 

35 use in the treatment of prostate cancer. 
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EXAMPLES 

The present invention is further described by the 
following examples. These examples are provided solely to 
illustrate the invention by reference to specific embodiments. 
5 These exemplifications, while illustrating certain aspects of 
the invention, do not portray the limitations or circumscribe 
the scope of the disclosed invention. 

All examples outlined here were carried out using 
standard techniques, which are well known and routine to those 
10 of skill in the art, except where otherwise described in 
detail. Routine molecular biology techniques of the following 
example can be carried out as described in standard laboratory 
manuals, such as Sambrook et al., MOLECULAR CLONING: A 
LABORATORY MANUAL, 2nd Ed.; Cold Spring Harbor Laboratory 
15 Press, Cold Spring Harbor, N.Y. (1989) . 

Example 1: Identification of CSGs 

Identification of CSGs were carried out by a systematic 
analysis of data in the LIFESEQ database available from Incyte 
Pharmaceuticals, Palo Alto, CA, using the data mining Cancer 
20 Leads Automatic Search Package (CLASP) developed by diaDexus 
LLC, Santa Clara, CA. 

The CLASP performs the following steps: selection of 
highly expressed organ specific genes based on the abundance 
level of the corresponding EST in the targeted organ versus 
25 all the other organs; analysis of the expression level of each 
highly expressed organ specific genes in normal, tumor tissue, 
disease tissue and tissue libraries associated with tumor or 
disease; selection of the candidates demonstrating component 
ESTs were exclusively or more frequently found in tumor 
30 libraries. The CLASP allows the identification of highly 
expressed organ and cancer specific genes. A final manual in 
depth evaluation is then performed to finalize the CSGs 
selection . 



BNSDOCID: <WO 00231 1 1A1 J_> 




WO 00/231 1 1 PCT/US99/24331 

- 17 - 

Clones depicted in the following Table 1 are CSGs useful 
in diagnosing, monitoring, staging, imaging arad treating 
prostate cancer. 



Table 1: CSGs 



Clone ID 


Pro # 


SEQ ID NO: 


3424528H1 


Prol09 


1,2 


578349H1 


Proll2 


3, 4 


1794013H1 


Prolll 


5, 6 


2189835H1 


Proll5 


7,8 


3277219H1 


ProllO 


9, 10 


1857415 


Proll3 


11, 12 


1810463H1 


Proll4 


13, 14 


zr65Gll 


Proll8 


15, 16 


2626135H1 




17 


zd46d08 




18 


1712252H1 




19 


784583H1 




20 



Example 2: Relative Quantitation of Gene Expression 

20 Real-Time quantitative PCR with fluorescent Taqman probes 

is a quantitation detection system utilizing the 5'- 3' 
nuclease activity of Taq DNA polymerase. The method uses an 
internal fluorescent oligonucleotide probe (Taqman) labeled 
with a 5' reporter dye and a downstream, 3' quencher dye. 

25 During PCR, the 5' -3' nuclease activity of Taq DNA polymerase 
releases the reporter, whose fluorescence can then be detected 
by the laser detector of the Model 7700 Sequence Detection 
System (PE Applied Biosystems, Foster City, CA, USA) . 

Amplification of an endogenous control is used to 

30 standardize the amount of sample RNA added to the reaction and 
normalize for Reverse Transcriptase (RT) efficiency. Either 
cyclophilin, glyceraldehyde-3-phosphate dehydrogenase (GAPDH) , 
ATPase, or 18S ribosomal RNA (rRNA) is used as this endogenous 
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control. To calculate relative quantitation between all the 
samples studied, the target RNA levels for one Sample were 
used as the basis for comparative results (calibrator) . 
Quantitation relative to the "calibrator" can be obtained 
5 using the standard curve method or the comparative method 
(User Bulletin #2: ABI PRISM 7700 Sequence Detection System). 

The tissue distribution and the level of the target gene 
were evaluated for every sample in normal and cancer tissues. 
Total RNA was extracted from normal tissues, cancer tissues, 
10 and from cancers and the corresponding matched adjacent 
tissues. Subsequently, first strand cDNA was prepared with 
reverse transcriptase and the polymerase chain reaction was 
done using primers and Taqman probes specific to each target 
gene. The results were analyzed using the ABI PRISM 7700 
15 Sequence Detector. The absolute numbers are relative levels 
of expression of the target gene in a particular tissue 
compared to the calibrator tissue. 



Expression of Clone ID 3424528H1 (Prol09) : 

For the CSG Prol09, real-time quantitative PCR was 
20 performed using the following primers: 
Forward Primer: 

5'- ATCAGAACAAAGAGGCTGTGTC - 3' (SEQ ID NO: 21) 
Reverse Primer: 

5'- ATCTCTAAAGCCCCAACCTTC - 3' (SEQ ID NO:22) 

25 The absolute numbers depicted in Table 2 are relative levels 
of expression of the CSG referred to as Prol09 in 12 normal 
different tissues. All the values are compared to normal 
stomach (calibrator) . These RNA samples are commercially 
available pools, originated by pooling samples of a particular 

30 tissue from different individuals. 
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Relative Levels of CSG Prol09 Expression in Pooled 
Samples * 



Tissue 


NORMAL 


Colon 


0 . Uz 


Endometrium 


u . u ± 


Kidney 


0.48 


Liver 


14 .83 


Ovary 


0. 08 


Pancreas 


4 .38 


Prostate 


11.24 


Small Intestine 


0. 42 


Spleen 


0 


Stomach 


1 


Testis 


0. 62 


Uterus 


0. 02 



The relative levels of expression in Table 2 show that with 
the exception of liver (14.83), Prol09 mRNA expression is 
higher (11.24) in prostate compared with all other normal 
tissues analyzed. Pancreas, with a relative expression level 
20 of 4.38, is the only other tissue expressing considerable mRNA 
for Prol09. 

The absolute numbers in Table 2 were obtained analyzing 
pools of samples of a particular tissue from different 
individuals. They cannot be compared to the absolute numbers 

25 originated from RNA obtained from tissue samples of a single 
individual in Table 3. 

The absolute numbers depicted in Table 3 are relative 
levels of expression of Prol09 in 28 pairs of matching samples 
and 4 unmatched samples. All the values are compared to 

30 normal stomach (calibrator) . A matching pair is formed by 
mRNA from the cancer sample for a particular tissue and mRNA 
from the normal adjacent sample for that same tissue from the 
same individual. 
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Table 3: Relative Levels of CSG Prol09 Expression 



Individual Samples 





Sample ID 


Tissue 


Cancer 


Matching 
Normal 
] Adjacent 




J Pro34B 


Prostate 1 


5. 98 


j 6.06 | 


5 


1 Pro65XB 


Prostate 2 


16. 68 


1 3 - 85 ! 




Pro69XB 


Prostate 3 


20.46 


1 6 * 82 i 




1 Pro78XB 


Prostate 4 


1.39 


I 1 • 4 ! 




[ProlOlXB 


Prostate 5 


24 . 8 


| 9.8 j 




J Prol2B 


Prostate 6 


9.1 


j 0.2 j 


10 


J Prol3XB 


Prostate 7 


0.5 


1 9 - 7 ! 




j Pro20XB 


Prostate 8 


13 


1 12 • 5 1 




1 Pro23B 


Prostate 9 


16.8 


[ 3 1 




I OvrlOOOSO 


Ovary 1 


0.4 






Ovrl028 


Ovary 2 


1.9 




15 


Ovrl8GA 


Ovary 3 




0.1 j 




Ovr206l 


Ovary 4 




o.i j 




Maml2X 


Mammary Gland 1 


13.5 


1.4 | 




Mam4 7XP 


Mammary Gland 2 


0.7 | 


0.2 | 




Lng47XQ 


Lung 1 


2.36 | 


0. 03 J 


20 


Lng60XL 


Lung 2 


7.39 | 


0.2 ! 




Lng75XC 


Lung 3 


0.77 | 


0.27 j 




StoAC44 


Stomach 1 


0.05 | 


1.19 1 




StoAC93 


Stomach 2 


0.55 j 


0.8 1 




StoAC99 


Stomach 3 


0.12 [ 


3.04 j 


25 


ColAS4 3 


Colon 1 


16.11 1 


0.07 | 




ColAS4 5 


Colon 2 


0.11 | 


0.08 | 




ColAS4 6 


Colon 3 


4.99 | 


0.4 | 




Livl5XA 


Liver 1 


8.43 | 


10.97 | 




Liv42X 


Liver 2 


1.57 I 


20.82 | 
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Liv94XA 


Liver 3 


9 Qft 


9 19 
_» . 


pan / / a 


P;=* n c ire* a ^ 1 

JL g i i \ — -L- C U O -I. 


36 


32 


Pan82XP 


Pancreas 2 


0.09 


7.09 


Pan92X 


Pancreas 3 


0.7 


0 


Pan71XL 


Pancreas 4 


2 .48 


0.73 


Panl0343 


Pancreas 5 


46 


5.5 



0 = Negative 



In the analysis of matching samples, the higher levels 
of expression were in prostate, showing a high degree of 

10 tissue specificity for prostate tissue. Of all the samples 
different than prostate analyzed, only 4 cancer samples (the 
cancer sample mammary 1 with 13.5, colon 1 with 16.11, liver 
1 with 8.43, and lung 2 with 7.39) showed an expression 
comparable to the mRNA expression in prostate. These results 

15 confirmed some degree of tissue specificity as obtained with 
the panel of normal pooled samples (Table 2) . 

Furthermore, the level of mRNA expression was compared 
in cancer samples and the isogenic normal adjacent tissue from 
the same individual. This comparison provides an indication 

20 of specificity for the cancer (e.g. higher levels of mRNA 
expression in the cancer sample compared to the normal 
adjacent) . Table 3 shows overexpression of Prol09 in 6 out 
of 9 primary prostate cancer tissues compared with their 
respective normal adjacents. Thus, overexpression in the 

25 cancer tissue was observed in 66.66% of the prostate matching 
samples tested (total of 9 prostate matching samples) . 

Altogether, the degree of tissue specificity, plus the 
mRNA overexpression in 66.66% of the primary prostate matching 
samples tested is indicative of Prol09 being a diagnostic 

30 marker for prostate cancer. 
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Expression of Clone ID 578349H1 (Proll2) : 

For the CSG Proll2, real-time quantitative PCR was 
performed using the following primers: 
Forward Primer 

5 5'- TGCCGAAGAGGTTCAGTGC - 3' (SEQ ID NO: 23) 

Reverse Primer 

5'- GCCACAGTGGTACTGTCCAGAT - 3' (SEQ ID NO: 24) 

The absolute numbers depicted in Table 4 are relative 
levels of expression of the CSG Proll2 in 12 normal different 
tissues. All the values are compared to normal thymus 
(calibrator) . These RNA samples are commercially available 
pools, originated by pooling samples of a particular tissue 
from different individuals. 

Table 4: Relative Levels of CSG Proll2 Expression in Pooled 
Samples 



10 



20 



25 



Tissue 


NORMAL 1 


Brain 


2.9 


Heart 


0-1 ] 


Kidney 


0.2 


Liver 


0 . 2 


Lung 


7.7 


Mammary 


4.2 


Muscle 


0.1 


Prostate 


5.5 


Small intestine 


1.8 


Testis 


1 


Thymus 


1 


Uterus 


21 



30 



The relative levels of expression in Table 4 show that 
Proll2 mRNA expression is the 3 rd most highly expressed gene 
(after uterus and mammary) in the pool of normal prostate 
tissue compared to a total of 12 tissues analyzed. The 
absolute numbers in Table 4 were obtained analyzing pools of 
samples of a particular tissue from different individuals. 
35 These results demonstrate that Proll2 mRNA expression is 
specific for prostate thus indicating Proll2 to be a 
diagnostic marker for prostate disease especially cancer. 
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Expression of Clone ID 1794013H1 (Prolll) : 

For the CSG Prolll, real-time quantitative PCR was 
performed using the following primers: 
Forward Primer 

5 5'- GCTGCAAGTTCTCCACATTGA - 3' (SEQ ID NO:25) 

Reverse Primer 

5'- CAGCCGCAGGTGAAACAC - 3' (SEQ ID NO: 26) 



The absolute numbers depicted in Table 5 are relative levels 
of expression of the CSG Prolll in 12 normal different 
10 tissues. All the values are compared to normal testis 
(calibrator) . These RNA samples are commercially available 
pools, originated by pooling samples of a particular tissue 
from different individuals. 

Table 5: Relative Levels of CSG Prolll Expression in Pooled 
15 Samples 



Tissue 


NORMAL 


Brain 


0.04 


Heart 


0 


Kidney 


0 


Liver 


0 


Lung 


0.05 


Mammary 


0.14 


Muscle 


5166.6 


Prostate 


1483.72 


Small Intestine 


0.33 


Testis 


r i 


Thymus 


0.49 


Uterus 


0. 07 1 



The relative levels of expression in Table 5 show that Prolll 
30 mRNA expression is extraordinarily high in the pool of normal 
prostate (1483.72) compared to all the other tissues analyzed 
with the exception of muscle (5166.6). These results 
demonstrate that Prolll mRNA expression shows specificity for 
prostate and muscle. 
35 The absolute numbers in Table 5 were obtained analyzing 

pools of samples of a particular tissue from different 
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individuals. They cannot be compared to the absolute numbers 
originated from RNA obtained from tissue samples Zf a single 
individual in Table 6. 

The absolute numbers depicted in Table 6 are relative 
5 levels of expression of Prolll in 48 pairs of matching and 18 
unmatched samples. All the values are compared to normal 
testis (calibrator) . A matching pair is formed by mRNA from 
the cancer sample for a particular tissue and mRNA from the 
normal adjacent sample for that same tissue from the same 
10 individual. 

Table 6: Relative Levels of CSG Prolll Expression in 



Individual Samples 



Sample JD 


Tissue 


Cancer 


Matching 

Normal 
Ad j acent 


ProlOlXB 


Prostate 1 


8 . 3 


21 . 8 


Prol2B 


Prostate 2 


2336 


133 


Prol3XB 


Prostate 3 


3.4 


23 


Pro20XB 


Prostate 4 


21.6 


121.5 


Pro23B 


Prostate 5 


19.4 


3.7 


Pro34B 


Prostate 6 


15 


39 


Pro65XB 


Prostate 7 


8 


867 


Pro69XB 


Prostate 8 


56 


94 


Pro78XB 


Prostate 9 


24 


1515 


Pro.84XB 


Prostate 10 


119 


15.35 


Pro90XB 


Prostate 11 


8 . 08 


112.2 


Pro91XB 


Prostate 12 


0.88 


51 . 8 


ProC215 


Prostate 13 


0.3 




ProC234 


Prostate 14 


0.35 




ProC280 


Prostate 15 


436.5 




Prol09XB 


Prostate 16 


3.43 


265 


ProllO 


Prostate 17 


18.2 


8.73 
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r I U 1 £ J AD 


Prostate 18 


0 . 34 


186 

» 


t JL ^ J ^ v 


Prostate 19 


1392 


110 


Prol OR 


Prostate 20 
(prostatitis) 


0 . 5 




Pro20R 


Prostate 21 
(prostatitis ) 


24 . 1 




Pro258 


Prostate 22 (BPH) 


a c i a 
4 o 1 U 




Pro263C 


Prostate 23 (BPH) 


A 
U 




Pro267A 


Prostate 24 (BPH) 


1.4b 




Pro271A 


Prostate 25 (BPH) 


rv 
U 




Pro460Z 


Prostate 2 6 (BPH) 


i ah 
1 - 4 / 




ProC032 


Prostate 27 (BPH) 


1/1 A 

1 4 . 4 




Tst39X 


Testis 1 


r\ 
U 


u 


Bld32XK ! 


Bladder 1 


U - 4 4 


U • H 1 


Bld4 6XK 


Bladder 2 


a i 
U 


u 


Bld66X 


Bladder 3 


n 
U 


A 
U 


BldTR14 


Bladder 4 


U 


A 
U 


Kidl06XD 


Kidney 1 


U 


A 
U 


Kidl07XD | 


Kidney 2 


a 
U 


A 
U 


Kidl09XD 


Kidney 3 


U 


A 
U 


Panl0343 


Pancreas 1 


u 


A 
U 


Pan71XL ; 


Pancreas 2 


0 


A 
U 


Pan77X 


Pancreas 3 


0 


U 


LivlSXA 


Liver 1 


u 


A 


Liv42X 


Liver 2 


r\ 


A 
U 


ClnAS43 


Colon 1 


/~\ 

u 


A 

U 


ClnAS4 b 


LOion z 


n 


o 


ClnAS4 6 


Colon 3 


0 


0 


ClnAS67 


Colon 4 


0 


0 


ClnAC19 


Colon 5 


0 


0 


ClnAS12 


Colon 6 


0 


0 
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SmI21XA 


Small Intestine 1 


n 

w 


n 
u 


SmIH8 9 


Small Intestine 2 


n 


U 


Lng47XQ 


Lung 1 


0 7 


U 


Lng60XL 


Lung 2 


n 


U 


Lng7 5XC 


Lung 3 




U 


Lng90X 


Lung 4 


A 
w 


U 


Maml2X 


Mammary Gland 1 


n 


1 . 4 


Mam5 9X 


Mammary Gland 2 




U 


MamA0 6X 


Mammarv Gland 3 


u 


r\ 
0 


MamS127 


Mammary Gland 4 


n 
u 


u 


Maml62X 


Mammarv Gland S 


u 


0 


Mam4 2DN 


Mammarv Gland 6 


u 


0 


Ovrl03X 


Ovary 1 


U • J. H 




Ovrl005O 


Ovary 2 


n 9 




Ovrl028 


Ovary 3 






Ovrl040O 


Ovary 4 


0.2 




Ovrl8GA 


Ovary 5 




0 


Ovr206I 


Ovary 6 




0 


Ovr20GA 


Ovary 7 




0.2 


Ovr25GA 


Ovary 8 




0 



0= Negative 



In the analysis of matching samples, the higher levels 
of .expression were in prostate showing a high degree of tissue 
specificity for prostate. These results confirm the tissue 
25 specificity results obtained with normal pooled samples (Table 
5) . 

Furthermore, the level of mRNA expression in cancer 
samples and the isogenic normal adjacent tissue from the same 
individual were compared. This comparison provides an 
30 indication of specificity for cancer (e.g. higher levels of 
mRNA expression in the cancer sample compared to the normal 
adjacent) . Table 6 shows overexpression of Prolll in 5 out 
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of 16 primary prostate cancer samples compared with their 
respective normal adjacent (prostate samples 2, 5,^10, 17, and 
19) . Similar expression levels were observed in 3 unmatched 
prostate cancers (prostate samples 13, 14, 15), 2 prostatitis 
5 (prostate samples 20, 21), and 6 benign prostatic hyperplasia 
samples (prostate samples 22 through 27) . Thus, there is 
overexpression in the cancer tissue of 31.25% of the prostate 
matching samples tested (total of 16 prostate matching 
samples ) . 

10 Altogether, the high level of tissue specificity, plus 

the mRNA overexpression in 31.25% of the prostate matching 
samples tested are indicative of Prolll being a diagnostic 
marker for prostate cancer. 

Expression of Clone ID 2189835H1 (Pxoll5) : 

15 For the CSG Proll5, real-time quantitative PCR was 

performed using the following primers: 
Forward Primer 

5'- TGGCTTTGAACTCAGGGTCA - 3' (SEQ ID NO:27) 
Reverse Primer 

20 5'- CGGATGCACCTCGTAGACAG - 3' (SEQ ID NO : 2 8 ) 

The absolute numbers depicted in Table 7 are relative levels 
of expression of the CSG Proll5 in 12 normal different 
tissues. All the values are compared to normal thymus 
(calibrator) . These RNA samples are commercially available 
25 pools, originated by pooling samples of a particular tissue 
from different individuals. 

Table 7 : Relative Levels of CSG ProllB Expression in Pooled 



Samples 



Tissue 


NORMAL 


Brain 


0.016 


Heart 


0.002 


Kidney 


8 .08 


Liver 


2.20 


Lunq 


112 . 99 
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Mammary 


29.45 


Muscle 


0.05 


Prostate 


337.79 


Small Intestine 


7.54 


Testis 


1.48 


Thymus 


1 


Uterus 


1.4 



15 



The relative levels of expression in Table 7 show that 
Proll5 mRNA expression is higher (337.79) in prostate compared 
10 with all the other normal tissues analyzed. Lung, with a 
relative expression level of 112.99, and mammary (29.446) are 
the other tissues expressing moderate levels^ of mRNA for 
Proll5. These results establish Proll5 mRNA expression to be 
highly specific for prostate. 

The absolute numbers in Table 7 were obtained analyzing 
pools of samples of a particular tissue from different 
individuals. They cannot be compared to the absolute numbers 
originated from RNA obtained from tissue samples of a single 
individual in Table 8. 

The absolute numbers depicted in Table 8 are relative 
levels of expression of Proll5 in 17 pairs of matching and 21 
unmatched samples. All the values are compared to normal 
thymus (calibrator) . A matching pair is formed by mRNA from 
the cancer sample for a particular tissue and mRNA from the 
25 normal adjacent sample for that same tissue from the same 
individual . 

Table 8: Relative Levels of CSG Proll5 Expression in 
Individual Samples 



20 



Sample ID 


Tissue 


Cancer 


Matching 

Normal 
Adjacent 


Prol2B 


Prostate 1 


1475. 9 


190.3 


ProC234 


Prostate 2 


169. 61 




Prol09XB 


Prostate 3 




639.53 


ProlOlXB 


Prostate 4 


1985.2 


2882.9 



30 
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Prol3XB 


Prostate 5 


? 4 Q 


13 9 

— * 


Pro215 


Prostate 6 


J C. .J * «-> 




Prcl25XB 


irrosuate / 




556 . 05 


Pro23B 


Fros uate o 


18 91 4 

X U y X • " 


1118.6 


ProC280 


prostate y 


4 S4 ^ 




Pro20XB 


Prostate 10 


1332. 6 




Pro34B 


Prostate 11 




362.91 


Pro65XB 


Prostate 12 




135. 06 


Pro69XB 


Prostate 13 




179. 67 


Prol OR 


Prostate 14 
(prostatitis) 


143.82 




Pro20R 


Prostate 15 
(prostatitis ) 


397 .79 




Pro258 


Prostate lb (brnj 


Z 1 O . O 




Pro263C 


Prostate 1 / (dFH) 


^("11 9 R 
DU1 . Z 3 




Pro267A 


n«««4--.4- rt TO /DDU\ 

Prostate lo (bFn) 


o n n op 

Z U U . Z O 




Pro271A 


Prostate iy (bi^nj 


111 4 ^ 




Pro4 60Z 


Prostate zu (Drn) 


^ ^ ft 4 




ProC032 


Prostate zi (tsrn; 


^ £ Q 4 




SmI21XA 


Small Intestine 1 


z 0 . 0 


9Q Q 
-7 • -7 


SmIH8 9 


Small Intestine z 


in p 


J1 O . J 


ClnAC19 


Colon 1 


Z Z . / O 


4 4 ft 4 7 


ClnAS12 


Colon 2 


lib. y / 


4 Q 1 1ft 


Kidl06XD 


Kidney 1 


Q £ 1 ^ 
DO. 1J 


4 1 14 


Kidl07XD 


Kidney 2 


n oc 

U . Z O 


*3 S 14 


Lng47XQ 


Lung 1 


D . 1 J 


9 n Qft 

z U . -/ 0 


LngoUXL 


Liung z 


1 ^ 91 
1 j . ^ j 


114 . 78 


Lng7 5XC 


Lung 3 


16.47 


53.79 


Maml2X 


Mammary Gland 1 


6.25 


10.75 


Maml62X 


Mammary Gland 2 


1.84 


2.54 


Mam4 2DN 


Mammary Gland 3 


23. 08 


35.51 
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OvrlOOSO 


Ovary 1 


0.9 




Ovrl028 


Ovary 2 


261 . 4 


— 


Ovrl03X 


Ovary 3 


7 


0.1 


Ovr20GA 


Ovary 4 




0 


Ovr25GA 


Ovary 5 




0 



0 = Negative 



Higher levels of expression were seen in prostate, 
showing a high degree of tissue specificity for prostate 
10 tissue. Of all the analyzed samples different from prostate, 
only two cancer samples (colon 2 with 116.97 and ovary 2 with 
261.4 ), and 5 normal adjacent tissue samples (small intestine 
2, colon 1, colon 2, kidney 1, and lung 2), showed an 
expression comparable to the mRNA expression in prostate. 
15 These results confirmed the tissue specificity results 
obtained with the panel of normal pooled samples (Table 7). 

Furthermore, the levels of mRNA expression in cancer 
samples and the isogenic normal adjacent tissue from the same 
individual were compared. This comparison provides an 
20 indication of specificity for the cancer (e.g. higher levels 
of mRNA expression in the cancer sample compared to the normal 
adjacent) . Table 8 shows higher expression of Proll5 in 3 out 
of 4 matched prostate cancer tissues (prostate samples 1, 5 
& 8) . 

25 Altogether, the high level of tissue specificity, plus 

the. higher expression in 75% of the prostate matching samples 
tested, are indicative of Proll5 being a diagnostic marker for 
prostate cancer. 

Expression of Clone ID 3277219H1 (ProllO) : 

30 For the CSG ProllO, real-time quantitative PCR was 

performed using the following primers: 
Forward Primer 

5'- CGGCAACCTGGTAGTGAGTG - 3' (SEQ ID NO: 29) 
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Reverse Primer 

5'- CGCAGCTCCTTGTAAACTTCAG - 3' (SEQ I© NO: 30) 

The absolute numbers depicted in Table 9 are relative levels 
of expression of the CSG ProllO in 12 normal different 
5 tissues. All the values are compared to normal small 
intestine (calibrator) . These RNA samples are commercially 
available pools/ originated by pooling samples of a particular 
tissue from different individuals. 

Table 9 : Relative Levels of CSG ProllO Expression in Pooled 
1 0 Samples 



Tissue 


NORMAL 


Brain 


6. 61 


Heart 


0.7 


Kidney 


0.74 


Liver 


7 . 94 


Lung 


11.88 


Mammary 


22 .78 


Muscle 


6.77 


Prostate 


3.01 


Small Intestine 


1 


Testis 


2.58 


Thymus 


13.74 


Uterus 


2 - 61 



The relative levels of expression in Table 9 show that ProllO 
25 mRNA expression is not as high in normal prostate (3.01) 

compared with all the other normal tissues analyzed. 

The absolute numbers in Table 9 were obtained analyzing 

pools of samples of a particular tissue from different 

individuals. They cannot be compared to the absolute numbers 
30 originated from RNA obtained from tissue samples of a single 

individual in Table 10. 

The absolute numbers depicted in Table 10 are relative 

levels of expression of ProllO in 33 pairs of matching 

samples. All the values are compared to normal small 
35 intestine (calibrator) . A matching pair is formed by mRNA 

from the cancer sample for a particular tissue and mRNA from 
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the normal adjacent sample for that same tissue from the same 
individual. 

Table 10: Relative Levels of CSG ProllO Expression in 



Individual Samples 



5 


Sample XD 


Tissue 


Cancer 


Matching 

flOZmal 

Adjacent 




Prol2B 


Prostate 1 


11. 8 


0.3 




Pro78XB 


Prostate 2 


14.3 


6.3 




ProlOlXB 


Prostate 3 


33.2 


10.7 




Prol3XB 


Prostate 4 


0.3 


0.4 


10 


Pro23XB 


Prostate 5 


25.5 


14 . 4 




Pro20XB 


Prostate 6 


43.3 


4 




Pro34XB 


Prostate 7 


31.8 


18.7 




Pro65XB 


Prostate 8 


26.9 


3.4 




Pro69XB 


Prostate 9 


12.5 


7 


15 


Lng75XC 


Lung 1 


1.9 


3 




Lng90X 


Lung 2 


5.5 


0.5 




LngACll 


Lung 3 


9.3 


9.7 




LngAC32 


Lung 4 


11.2 


2.2 




Lng47XQ 


Lung 5 


11.3 


0.3 


20 


Lng60XL 


Lung 6 


29.1 


6.8 




Maml2B 


Mammary Gland 


1 


19.8 


0 




Mam603X 


Mammary Gland 


2 


13.7 


0 




Mam82XI 


Mammary Gland 


3 


73.5 


0 




MamA04 


Mammary Gland 


4 


0 


24.6 


25 


MamBOllX 


Mammary Gland 


5 


17.4 


2 




MamC012 


Mammary Gland 


6 


0 


12 . 8 




MamC034 


Mammary Gland 


7 


0 


61 




Maml2X 


Mammary Gland 


8 


14 


2.2 




Mam59X 


Mammary Gland 


9 


33 


2.2 
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Mam A H £ Y 
L V J alUn U OA 


Mammarv Gland 10 


16.4 


0.8 




Liver 1 


4 .7 


0.6 


LI V4ZA 


Liver 2 


7 . 5 


2.6 


J_il V z? 1 Ari 


Liver 3 


0 . 4 


1 . 4 


ClnAS43 


Colon 1 


52 . 9 


1.4 


ClnAS45 


Colon 2 


2 . 1 


0.8 


ClnAS46 


Colon 3 


39.8 


3.7 


SmI21X 


Small Intestine 1 


0.9 


0.1 


SmIH89 


Small Intestine 2 


5.8 


0.9 



10 0 = Negative 

The levels of mRNA expression in cancer samples and the 
isogenic normal adjacent tissue from the same individual were 
compared. This comparison provides an indication of 

specificity for the cancer (e.g. higher levels of mRNA' 

15 expression in the cancer sample compared to the normal 
adjacent) . Table 10 shows overexpression of ProllO in 8 of 
the 9 primary prostate cancer tissues compared with their 
respective normal adjacent (except prostate 4). Thus, there 
was overexpression in 88.88% of the cancer prostate tissue 

20 as compared to the prostate matching samples tested (total of 
9 prostate matching samples) . 

Although not tissue specific, ProllO mRNA expression is 
upregulated in prostate cancer tissues. The mRNA 

overexpression in 88.88% of the primary prostate matching 

25 cancer samples tested is indicative of ProllO being a 
diagnostic marker for prostate cancer. ProllO also showed 
overexpression in several other cancers tested including small 
intestine, colon, liver, mammary and lung (see Table 10) . 
Accordingly ProllO may be a diagnostic marker for other types 

30 of cancer as well. 
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Expression of Clone ID 1857415; Gene ID 346880 (Proll3) : 

For the CSG Proll3, real-time quantitative PCR was 
performed using the following primers: 
Forward Primer 

5 5'- CGGGAACCTACCAGCCTATG - 3' (SEQ ID NO: 31) 

Reverse Primer 

5'- CAGGCAACAGGGAGTCATGT - 3' (SEQ ID NO: 32) 

The absolute numbers depicted in Table 11 are relative levels 
of expression of the CSG Proll3 in 12 normal different 
10 tissues. All the values are compared to normal thymus 
(calibrator) . These RNA samples are commercially available 
pools, originated by pooling samples of a particular tissue 
from different individuals. 

Table 11: Relative Levels of CSG Proll3 Expression in 

15 Pooled Samples 



Tissue 


! NORMAL 


Brain 


0.03 


Heart 


0 


Kidney 


0.01 


Liver 


0 


Lung 


0 


Mammary Gland 


0 


Muscle 


0.04 


Prostate 


489.44 


Small Intestine 


0.02 


Testis 


0.35 


Thymus 


1 


Uterus 


0.13 1 



The relative levels of expression in Table 11 show that Proll3 
30 mRNA expression is higher (489.44) in prostate compared with 
all the other normal tissues analyzed. Testis, with a 
relative expression level of 0.35, uterus (0.13), thymus 
(1.0), kidney (0.01) and brain (0.03) were among the other 
tissues expressing lower mRNA levels for Proll3. These 
35 results establish that Proll3 mRNA expression is highly 
specific for prostate. 



8NSDOCID: <WO_ 00231 11 A1 J_> 



WO 00/23111 



PCT/US99/24331 



- 35 - 

The absolute numbers in Table 11 were obtained analyzing 
pools of samples of a particular tissue froi** different 
individuals. They cannot be compared to the absolute numbers 
originated from RNA obtained from tissue samples of a single 
5 individual in Table 12. 

The absolute numbers depicted in Table 12 are relative 
levels of expression of Proll3 in 78 pairs of matching and 25 
unmatched tissue samples. All the values are compared to 
normal thymus (calibrator) . A matching pair is formed by mRNA 

10 from the cancer sample for a particular tissue and mRNA from 
the normal adjacent sample for that same tissue from the same 
individual. In cancers (for example, ovary) where it was not 
possible to obtain normal adjacent samples from the same 
individual, samples from a different normal individual were 

15 analyzed. 

Table 12: Relative Levels of CSG Proll3 Expression in 



Individual Samples 



Sample ID 


Tissue 


Cancer 


Matched or 
Unmatched 

Normal 
Adjacent 


Pro780B/781B 


Prostate 1 


375.58 


446.29 


Prol291B/1292B 


Prostate 2 


1060 


31 


Prol39B96/140B96 


Prostate 3 


41 


32 


Pro209B96/210B96 


Prostate 4 


505 


255 


Prpl256B/1257B 


Prostate 5 


165.79 


141.63 


Prol293B/1294B 


Prostate 6 


1613.7 


874 . 61 


Pro694B/695B 


Prostate 7 


458 . 6 


142.21 


Prol012B/1013B 


Prostate 8 


1520 


864 


Prol222B/1223B 


Prostate 9 


939 


530 


Pro845B/846B 


Prostate 10 


1552 . 4 


374.6 


Prol094B/1095B 


Prostate 11 


278 .37 


135.89 


Pro650B/651B 


Prostate 12 


532.81 


640 . 85 
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| Pro902B/903B 


Prostate 13 


609.05 


415.8 6 - 


| Pro916B/917B 


Prostate 14 


699.42 


1*. 

J 401.24 | 


| Pro9821110A/110B 


Prostate 15 


156 


| 487.8 | 


| ProS9821326A/26B 


Prostate 16 


744 . 4 


| 472.8 | 


1 Jrroy^u/c^lo 


Prostate 17 


1389.2 




1 Pro9407c234 


Prostate 18 


305.5 


| 1 


J Pro9407c280A 


Prostate 19 


894 .5 




Pro9409C010R 


(prostatitis) 


O £ Q "7 





Pro9404C120R 


riuo La Lc ^ 1 

(prostatitis) 


O Q Q O 




I Prol000258 


Prost a t <=> 0 0 

*~ i- to LC Z- 

(BPH) 


1 H i7 . D 





I Pro4001263C 


(BPH) 


C\ 1 £ 




J Pro4001267A 


Prn^t^fp OA 
(BPH) 






Pro9411C032 


rx uo La Lc ^ j 

(BPH) 


lib . Z 


| 


Pro4001460Z 


Prostate 26 
(BPH) 


276.3 




1 Pro4U01271A 


Prostate 27 
(BPH) 


58.7 1 


1 


| Kidl064D/65D 


Kidney 1 


0 | 


0.1 j 


| Kidl079D/1080D 


Kidney 2 


0.3 | 


0.02 | 


| Kidl097D/1098D 


Kidney 3 


35.14 | 


0. 32 1 


| Kidl024D/1025D 


Kidney 4 


1 .31 


0 J 


| Kidll83D/1184D 


Kidney 5 


24 .79 


0 1 


| Kidl242D/1243D 


Kidney 6 


o 1 


0 i 


| Bld469K 


Bladder 1 




2.88 | 


| Bld467K/468K 


Bladder 2 


2.65 | 




| Bld327K/328K 


Bladder 3 


0 | 


4 .05 | 


| Bld470K 


Bladder 4 




1.64 | 


| Bld665T/664T 


Bladder 5 


0.21 ' 


1. 99 | 
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BXaX4ybr\/ ± 4 y / 1\ 


B 1 adder 6 


13.55 


1.14 
v 


n 1 ^TTOl v / I TOOK' 

Bldl / z ±J\/ l / zzi\ 


Bl adder 7 


120 . 16 


1.34 


1 S t Z -5 yA/ Z ft UA 


Testis 1 


31.5 1 


0.73 


Tef QQfl9n^47a /47R 


Testis 2 


15.7 


0 


1 S tbyoZUOOjH/ ODOD 


Testis 3 


72 


1.4 


Cb-nCQP91 94RZ\/94ftR 


Skin 1 


1.8 


0.5 


SknS99448A/44SB 


Skin 2 


251.6 


0 


Skn99816A/816B 


Skin 3 


33 


0.7 


Sto4004864A4/B4 


Stomach 1 


14 . 12 


0 


Sto4004509A3/Bl 


Stomach 2 


40.74 


39 


SmI9807A212A/213A 


Small 

Intestine 1 


0.1 


0 


SmI9802H008/H009 


Small 

Tntp^fi np 2 


5.8 


0.1 


ClnybuohSUlz/ duii 


Pol on 1 


4 . 5 


0 


Clny / uycu / fira/u / .3 r a 


Col on 2 


65 . 8 


3.1 


pi n/i nn/nnQai /7 HQR1 

L.-l_n4UU4 / UyHl/ /u^ijx 


Col on 3 


1 . 1 


0.9 




Colon 4 


34 .76 


0.73 


Liny / U /CUU4gD/ uuoya 


Colon 5 


90.26 


0.96 




Co] on 6 

> — V-/ _1_ w i 1 


17 . 9 


20. 64 


pi n Q/:i onnn^ / RODS 


Col on 7 


17 .56 


0 . 3 


Clny / Uor uuzu/ t uuiL 


Cnl on ft 


21.39 


0 




Col on 9 


429.14 


142.69 


Faniu .jfi ja 


r uii^i cao j. 


0 


0 


Pan / / Or/ / / / r 


Panrrpfls 2 

JL d 1 1 v X. ~ u Q 


0 


0.15 


Panyziu/ y zzu 




7 .36 


0 


Pan714L/715L 


Pancreas 4 


13.57 


0.11 


Pan824P/825P 


Pancreas 5 


0 


0 


Lng476Q/477Q 


Lung 1 


0 


0 


Lng605L/606L 


Lung 2 


0 


0.1 


Lnglll45B/11145C 


Lung 3 


85.9 


0 
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Lng0008632A/32B 



Lung 4 



23.85 



Lng750C/751C 



Lung 5 



0.32 



0.25 



Lng8890A/8890B 



Lung 6 



10. 63 



Lng8926A/8926B 



Lung 7 



15.37 



0 



Lng0010239A/39B 



Lung 8 



26. 17 



Lng9502C109R/110R 



Lung 9 



0. 68 



LngS9821944a/44b 



Lung 10 



J Mam00042D01/42N01 


Mammary Gland 1 


8.5 


1 0 


j Mam5 9XC 


Mammary Gland 2 


61 . 07 


1 0 


| Mam9706A066G/67C 


Mammary Gland 3 


4 .84 


0 


| Maml4153alC 


Mammary Gland 4 


9.72 


6.99 


1 Maml620F/1621F 


Mammary Gland 5 | 


0. 91 


0 


[Mam00014D05 ] 


Mammary Gland 6 


2.45 


0 


! Endl0479B/D | 


Endometrium 1 t 


133.43 1 


1.12 



15 End9705A125A/126A 



Endometrium 2 



0.39 



End9704C281A/282A 



Endometrium 3 



23.5 



1.56 



End680o97/681o97 



Endometrium 4 



88.89 



Utrl3590/13580 



Uterus 1 



0.2 



79.02 



Utr850U/851U 



Uterus 2 



20 Utrl4170/14180 



Uterus 3 



14 



0.4 



Utr233U96/234U96 



Uterus 4 



8 . 65 



CvxVNM00052D01/52N01 Cervix 1 



0.82 



4.64 



77.15 



CvxVNM00083D01/83N01 Cervix 2 



0 .78 



221. 48 



1 CvxND00023D01/23N01 


Cervix 3 


! 3.25 


j 15.22 


j Ovrl037O/1038O 


Ovary 1 


0.1 


0 


OvrlOOSO 


Ovary 2 


18.96 




Ovrl028 


Ovary 3 


0 




Ovrl4638AlC j 


Ovary 4 


3.2 




Ovrl4603AlD 


Ovary 5 


882.3 




Ovr7730 


Ovary 6 j 


0 | 





30 
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Ovary 7 




0.15 


Ovr206I 


Ovary 8 




v . — 

0 


Ovr9702C020GA 


Ovary 9 




0 


Ovr9702C025GA 


Ovary 10 




0 


Ovr9701C035GA 


Ovary 11 




0.07 


Ovr9701C050GB 


Ovary 12 




0.58 



0 = Negative 



In the analysis of matching samples, the higher levels 
of expression were in prostate, showing a high degree of 

10 tissue specificity for prostate tissue. In addition to the 
higher expression levels in prostate cancer samples, Proll3 
expression was found to be either induced (where not expressed 
in normal adjacent tissues) or somewhat upregulated in several 
other cancers. However, the relative expression and the fold 

15 increase in prostate cancer samples far exceeds that in other 
cancer tissues and is highly significant. 

Furthermore, the levels of mRNA expression in cancer 
samples and the isogenic normal adjacent tissue from the same 
individual were compared. This comparison provides an 

20 indication of specificity for the cancer (e.g. higher levels 
of mRNA expression in the cancer sample compared to the normal 
adjacent) . Table 12 shows overexpression of Proll3 in 13 out 
of 16 primary prostate cancer tissues compared with their 
respective normal adjacent (prostate samples 2, 3, 4, 5, 6 7, 

25 8, 9, 10, 11, 13, 14, 16) . Thus, there was overexpression in 
the cancer tissue for 81.25% of the prostate matching samples 
tested. The median for the level of expression in prostate 
cancer tissue samples is 609, whereas the median for all other 
cancers is only 7.93, with the exception of one colon sample, 

30 colon 9, whose expression was similar to that found in 
prostate cancer tissues. 

Altogether, the high level of tissue specificity, plus 
the mRNA overexpression in 81.25% of the primary prostate 
matching samples tested are indicative of Proll3 being a 



BNSDOCID: <WO 00231 1 1A1_I_> 



WO 00/23111 



PCT/US99/24331 



- 40 - 

diagnostic marker for prostate cancer. Expression was also 
found to be higher in other cancer tissues compared with their 
respective normal adjacent tissues (kidney, bladder, testis, 
skin, stomach, small intestine, colon, pancreas, lung, 
5 mammary, endometrium, uterus, and ovary) thus indicating 
Proll3 to be a pan cancer marker. 

Expression of Clone ID 1810463H1 (Proll4) : 

For the CSG Proll4, real-time quantitative PCR was 
performed using the following primers: 
10 Forward Primer 

5'- TGGGCATCTGGGTGTCAA - 3' (SEQ ID NO: 33) 
Reverse Primer 

5'- CGGCTGCGATGAGGAAGTA - 3' (SEQ ID NO: 34) 
The absolute numbers depicted in Table 13 are relative 
15 levels of expression of the CSG Proll4 in 12 normal different 
tissues. All the values are compared to normal muscle 
(calibrator) . These RNA samples are commercially available 
pools, originated by pooling samples of a particular tissue 
from different individuals. 
20 Table 13: Relative Levels of CSG Proll4 Expression in 



Pooled Samples 



Tissue 


NORMAL 


Brain 


9.7 


Heart 


0.7 


Kidney 


414 . 4 


Liver 


4 


Lung 


882.2 


Mammary 


44 


Muscle 


1 


Prostate 


1951 


Small Intestine 


22 


Testis 


367.1 


Thymus 


25.8 


Uterus 


139.6 



35 The relative levels of expression in Table 13 show that Proll4 
mRNA expression is higher (1951) in prostate compared with all 
the other normal tissues analyzed. Lung, with a relative 
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expression level of 882,2, kidney 414.4, testis 367.1 and 
uterus 139.6, are the other tissues expressing higher levels 
of mRNA" for Proll4. These results establish Proll4 mRNA 
expression to be more specific for prostate than other tissues 
5 examined. 

The high level of tissue specificity is indicative of 
Proll4 being a diagnostic marker for diseases of the prostate, 
especially cancer. 

Expression of Clone ID zr65gll (Proll8) : 

10 For the CSG Proll8, real-time quantitative PGR was 

performed using the following primers: 
Forward Primer 

5'- GCCCATCTCCTGCTTCTTTAGT - 3' ( SEQ ID NO:35) 



Reverse Primer 

15 5'- CGTGGAGATGGCTCTGATGTA - 3' (SEQ ID NO: 36) 

The absolute numbers depicted in Table 14 are relative 
levels of expression of the CSG Proll8 in 12 normal different 
tissues. All the values are compared to normal kidney 
(calibrator) . These RNA samples are commercially available 
20 pools, originated by pooling samples of a particular tissue 
from different individuals. 

Table 14: Relative Levels of CSG Proll8 Expression in 



Pooled Samples 



Tissue 


NORMAL 


Colon 


0.87 


Endometrium 


19282 j 


Kidney 


1 


Liver 


0 


Ovary 


86.22- 


Pancreas 


o 1 


Prostate 


962.1 


Small Intestine 


0 


Spleen 


0.75 


Stomach 


0.54 


Testis 


343.7 


Uterus 


1064 
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The relative levels of expression in Table *4 show that 
Proll8 mRNA expression is the 3 rd highest in prostate (962.1) 
next to endometrium (19282) and uterus (1064), which are 
female-specific tissues. Testis, with a relative expression 
5 level of 343.7 is the only other male tissue expressing 
moderate levels of mRNA for Proll8. These results establish 
Proll8 mRNA expression to be highly specific for reproductive 
tissues including the prostate. 

The absolute numbers in Table 14 were obtained analyzing 
10 pools of samples of a particular tissue from different 
individuals. They cannot be compared to the absolute numbers 
originated from RNA obtained from tissue samples of a single 
individual in Table 15. 

The absolute numbers depicted in Table 15 are relative 
15 levels of expression of Proll8 in 59 pairs of matching and 21 
unmatched samples. All the values are compared to normal 
kidney (calibrator) . A matching pair is formed by mRNA from 
the cancer sample for a particular tissue and mRNA from the 
normal adjacent sample for that same tissue from the same 
20 individual. 



Relative Levels of CSG Proll8 Expression in 
Individual Samples 





Sample ID 


Tissue 


Cancer 


Matching 

Normal 
Adjacent: 




Prol2B 


Prostate 


1 


41700.7 


22242 . 83 


25 


ProC234 


Prostate 


2 


40087 






Pro78XB 


Prostate 


3 


4075. 6 


7066.7 




Prol09XB 


Prostate 


4 


334 . 4 


777.2 




Pro84XB 


Prostate 


5 


11684 


58290 




ProlOlXB 


Prostate 


6 


21474 . 13 


100720. 8 


30 


Pro91X 


Prostate 


7 


14849 


33717 




Prol3XB 


Prostate 


8 


202 . 57 


146. 91 



BNSDOCID: <WO_00231 1 1A1 J_> 



WO 00/23111 



PCT/US99/24331 



- 43 - 



DrnP9 1 S 
ir r OUZ ± 


Prostate 9 


73243 






Prostate 10 


629. 6 


521.4 




Prostate 11 


157532. 6 


110654 .4 


PrnQDYR 


Prostate 12 


2317 


64134 


p T - r >r' o p n 

XT I. KJ\^ O U 


Prostate 13 


42020 




irlOZ UAD 


Prostate 14 


2909.31 






Prostate 15 


29610 


23264 


ProllO 


Prostate 16 


13354 


30991 


Pro65XB 


Prostate 17 


10126 


11270 


Pro69XB 


Prostate 18 




2671 . 42 


Pro326 


Prostate 19 


9962.3 


19231 


ProlOR 


Prostate 20 
(prostatitis) 


27355 




Pro20R 


Prostate 21 
(Drostatitis) 


21081 




f J_ OZ 3 D 


Prostate 22 (BPH) 


79916.32 




it r O Z D jL 


Prostate 23 (BPH) 


108924 . 5 




Ir J_ OZ D / r\ 


Prostate 24 (BPH) 


92910.22 




Dm9 7 1 A 
r lOZ / Xri 


Prostate 25 (BPH) 


57004 . 4 




Jr r Ofl DU6 


Prostate 26 (BPH) 


57449.23 






Prostate 27 (BPH) 


45781 . 44 




JMUlUDAL' 


i\ _i_ V-A i i ^ _y -L- 


3 . 08 


217 .36 


Mul U / A1J 


Ki drif»v 2 

I\ _L v-l liC J 


0 


38.36 


\c \ hi n Qvn 
Mai u ^Au 


I\±UJlCjr — ' 


0 


123.5 


MUl U AU 


Ki Htipv 4 


17 . 69 


67 .8 


Mai l A JJ 


PC i H n v S 


16.74 


360.8 


Kidl24D 


Kidney 6 


0 


167.4 


Bld32XK 


Bladder 1 


0 


0 


Bld47K 


Bladder 2 




36.38 


Bld66X 


Bladder 3 


0 


4 .52 


BldTR14 


Bladder 4 


0 


12 . 17 
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BldTR17 


Bladder 5 


0 


I 0 | 




Bld4 6XK 


Bladder 6 


16.5 


j 0 




Tst39X 


Testis 1 


116.6 


| 24.35 | 




Tst647T 


Testis 2 


856.16 


| 43.5 | 


5 


StoAC4 4 


Stomach 1 


0 


° 1 




StoAC93 


Stomach 2 


0 


0 | 




SmI21XA 


Small Intestine 1 


68 .45 


1 0 1 




SmIH8 9 


Small Intestine 2 


0 


1 0 1 




ClnAC19 


Colon 1 


149 


| 21.33 | 


10 


ClnAS12 


Colon 2 


0 


1 0 1 




ClnB34 


Colon 3 


0 


1 0 1 




ClnB56 


Colon 4 


13.04 


j 5.22 | 




ClnAS43 


Colon 5 


0 


0 j 




Lng4 7XQ 


Lung 1 


0 


0 | 


15 


Lng60XL 


Lung 2 


0 


0 1 




Lng75XC 


Lung 3 


0 


3.38 | 




Lng90X 


Lung 4 


0 j 


0 | 




LngBR2 6 


Lung 5 


0 


26.82 | 




Panl0343 


Pancreas 1 


50.47 j 


0 | 


20 


Pan77X 


Pancreas 2 


281.1 j 


0 | 




Pan92X 


Pancreas 3 


18.41 | 


0 | 




Pan71XL 


Pancreas 4 


0 


0 j 




Pan82XP 


Pancreas 5 


0 | 


0 J 




PanC044 


Pancreas 6 


0 i 


0 J 


25 


Maml2X 


Mammary Gland 1 ! 


0 | 


0 j 




Maml62X 


Mammary Gland 2 


0 | 


0 j 




Mam42DN 


Mammary Gland 3 


o i 


0 | 




MamS 127 


Mammary Gland 4 


12.58 


0 j 




Maml4DN 


Mammary Gland 5 


0 i 


o 1 


30 


End28XA ! 


Endometrium 1 


331.9 


1824 j 
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End3AX 


Endometrium 2 


27825 


65839 


Ehd4XA 


Endometrium 3 


10. 3 


15935 


Utrl410 


Uterus 1 


18885 


18116 


Utr23XU 


Uterus 2 


3358 


7674 


CvxKS52 


Cervix 1 


0 


0 


CvxKS83 


Cervix 2 


0 


0 


Ovrl005O 


Ovary 1 


72.86 




Ovrl028 


Ovary 2 


0 




Ovr638A 


Ovary 3 


0 




Ovr63A 


Ovary 4 


90.88 




Ovr7730 


Ovary 5 


1.21 




Ovrl040O 


Ovary 6 


5.08 




OvrlOSO 


Ovary 7 


0 




Ovrlll8 


Ovary 8 


7.41 




Ovrl03X 


Ovary 9 




32 .78 


Ovr20GA 


Ovary 10 




0 


Ovr25GA 


Ovary 11 




1173.83 


Ovr35GA 


Ovary 12 




313.4 


Ovr50GB 


Ovary 13 




823. 1 


Ovrl8GA 


Ovary 14 




40.6 


Ovr206I 


Ovary 15 




1264 


Ovr230A 


Ovary 16 




1285 



0 = Negative 



In the analysis of matching samples, the higher levels of 
25 expression were in prostate, endometrium, testis, and ovary 
showing a high degree of tissue specificity for reproductive 
tissues. These results confirmed the tissue specificity 
results obtained with the panel of normal pooled samples 
(Table 14) . 

30 Furthermore, the levels of mRNA expression in cancer 

samples and the isogenic normal adjacent tissue from the same 
individual were compared. This comparison provides an 
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indication of specificity for the cancer (e.g. higher levels' 
of mRNA expression in the cancer sample compared t$. the normal 
adjacent) . Table 15 shows overexpression of Proll8 in 5 out 
of 14 primary prostate cancer tissues (prostate samples 1, 8, 
5 10, 11, 15) compared with their respective normal adjacent. 
Thus, there was overexpression in the cancer tissue for 35.71% 
of the prostate matching samples tested (total of 14 prostate 
matching samples) . Expression of Proll8 was similarly higher 
in 3 unmatched cancer tissues (prostate samples 9, 13, 14), 

10 2 prostatitis (prostate samples 20, 21), and 6 benign 
hyperplasia tissues (prostate samples 22 through 27) . 

Altogether, the high level of tissue specificity, plus 
the mRNA overexpression in 35.71% of the primary prostate 
matching samples tested are indicative of Proll8 being a 

15 diagnostic marker for prostate cancer . 
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10 



15 



20 



25 



What is claimed is: 

1. A method for diagnosing the presence «of prostate 
cancer in a patient comprising: 

(a) determining levels of CSG in cells, tissues or bodily 
fluids in a patient; and 

(b) comparing the determined levels of CSG with levels 
of CSG in cells, tissues or bodily fluids from a normal human 
control, wherein a change in determined levels of CSG in said 
patient versus normal human control is associated with the 
presence of prostate cancer. 

2 . A method of diagnosing metastases of prostate cancer 
in a patient comprising: 

(a) identifying a patient having prostate cancer that is 
not known to have metastasized; 

(b) determining CSG levels in a sample of cells, tissues, 
or bodily fluid from said patient; and 

(c) comparing the determined CSG levels with levels of 
CSG in cells, tissue, or bodily fluid of a normal human 
control, wherein an increase in determined CSG levels in the 
patient versus the normal human control is associated with a 
cancer which has metastasized. 

3. A method of staging prostate cancer in a patient 
having prostate cancer comprising: 

(a) identifying a patient having prostate cancer; 

(b) determining CSG levels in a sample of cells, tissue, 
or bodily fluid from said patient; and 

(c) comparing determined CSG levels with levels of CSG 
in cells, tissues, or bodily fluid of a normal human control, 
wherein an increase in determined CSG levels in said patient 
versus the normal human control is associated with a cancer 
which is progressing and a decrease in the determined CSG 
levels is associated with a cancer which is regressing or in 
remission . 
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4 . A method of monitoring prostate cancer in a patient 
for the onset of metastasis comprising: ♦ 

(a) identifying a patient having prostate cancer that is 
not known to have metastasized; 

5 (b) periodically determining levels of CSG in samples of 

cells, tissues, or bodily fluid from said patient; and 

(c) comparing the periodically determined CSG levels with 
levels of CSG in cells, tissues, or bodily fluid of a normal 
human control, wherein an increase in any one of the 
10 periodically determined CSG levels in the patient versus the 
normal human control is associated with a cancer which has 
metastasized . 

5. A method of monitoring a change in stage of prostate 
cancer in a patient comprising: 

15 (a) identifying a patient having prostate cancer; 

(b) periodically determining levels of CSG in cells, 
tissues, or bodily fluid from said patient; and 

(c) comparing the periodically determined CSG levels with 
levels of CSG in cells, tissues, or bodily fluid of a normal 

20 human control, wherein an increase in any one of the 
periodically determined CSG levels in the patient versus the 
normal human control is associated with a cancer which is 
progressing in stage and a decrease is associated with a 
cancer which is regressing in stage or in remission. 

25 6. A method of identifying potential therapeutic agents 

for use in imaging and treating prostate cancer comprising 
screening molecules for an ability to bind to CSG wherein the 
ability of a molecule to bind to CSG is indicative of the 
molecule being useful in imaging and treating prostate cancer. 

30 7. The method of claim 1, 2, 3, 4, 5 or 6 wherein the 

CSG comprises SEQ ID NO:l, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 
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13, 14, 15, 16, 17, 18, 19 or 20 or a polypeptide encoded 
thereby. * 

8. An antibody which specifically binds CSG . 



5 comprising administering to the patient an antibody of claim 



10. The method of claim 9 wherein said antibody is 
labeled with paramagnetic ions or a radioisotope. 

11. A method of treating prostate cancer in a patient 
10 comprising administering to the patient an antibody of claim 

7. 

12. The method of claim 11 wherein the antibody is 
conjugated to a cytotoxic agent. 



9. 



A method of imaging prostate cancer in a patient 



8. 
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<400> 1 

ggtaaacacc tgcttttatc atcagaacaa agaggctgtg tcccctgccc tatgaggtcc 60 
atttctgaga gttgtggcta atgggcaaga aggttggggc tttagagatt tgggataaag 12 0 
atatcaaaca ccagaaaggt agaaagaagt gatcagatta gggttactta ggtgatgata 180 
tgaactct 188 

<210> 2 

<211> 9819 

<212> DNA 

<213> Homo sapiens 



cagctggggt ctacccaggt ccatgtcttg gacatgttga gagtttttct ggaaggcagg 60 
gatacagtgt ggtccaaaaa cacacaaatg cccctactgg cccaggggtt gtcacaatag 120 
actggaaggg tgacacatcc caggcgcttg ccacccatca cacgcacctc ctacccactg 180 
gcatccttcc accccaggca cacacaaagc ctcagtccag agatcaactc tggactcagc 240 
tctgaatttg catatcctgt gtgtagattc attcttcata acctctgccc agcctagctt 300 
gtgtatcatt tttttttctc tattagggga ggagcccgtc ctggcactcc cattggcctg 360 
tagattcacc tcccctgggc agggccccag gacccaggat aatatctgtg cctcctgccc 420 
agaaccctcc aagcagacac aatggtaaga atggtgcctg tcctgctgtc tctgctgctg 480 
cttctgggtc ctgctgtccc ccaggagaac caagatggtg agtggggaaa gcaagggatg 54 0 



<400> 2 



1 
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ggtgctggag aggactggaa ggaggtgagg aacaggacat gtggctggga gacaggctgg 600 
atgcagctgg gataccctgg catacggcag gaatgggtgc ccaaggctgt caactccctc 660 
agctcacaca cttccaggag cattcaggga gcctctgcgc tggcccgaaa taagaccttc 720 
aggaatctga atctaaaacc cctagtttac agtgaaaaca aagactccaa agaccaagcg 780 
acctgcttgg ggtagacagt caggacggag taggaaccat atgcctggag ctgcttctgc 840 
tcctgttcct tccctccttc cgatggctgg gtacacctgc ctgacgctga ggaaaagaga 900 
gagcagcccc aaggggaaag tgggaaggca ggttggctgg agggatggtg ctagaaggaa 960 
acccgtgccc aaatcccaca ctcagacacc actgcagtgg gtctggaagg cgagtggctg 1020 
gaagagaaga gagtgggagc tccgggagat caagagtcac tcctaggata agggaaggag 1080 
gctgtttgtg gcatgagaat gtgcaggata aagacatgga agcgaatggc ttctcagttg 1140 
tgtgagttta aaattcatga catttacaaa ttgtcagaaa aggtgttata tgtttgttat 1200 
ataacaatca ctttggaatg ttaatctgat tctgtgccaa aatctgaatt actcagggtt 1260 
ctccagagaa acagaactaa taggtggtac acatatacat atatatgtac gtacacatac 132 0 
atacatacac tgtatacaca tggatacaca cacacatagg aagagattta catatatgta 13 80 
tacaaaagag agagagagta gagatttatt ttaagaaatt gactcacact attgggagga 144 0 
gtaacaagtc ctaaatcttc agagccggcc agcaggctgg agacccaggg aagagttgat 1500 
gtcttagtct tgattccaag ggcagactgt aggcagaatt ctttcctctt taggggacat 1560 
ctgaggcttt ttctcttaag gccttcaact gattggatga agcccaccac tatggagagt 1620 
aatccacttt actcaaggtc tactgatttt tttgtaaatt aaaaaaaaaa ctgtgggtgc 1680 
atagtatgtg tatatattta tggggtacat gagaggtttt gattcaggca tgcaatgtga 1740 
aataatcaca tcatcaaaaa tgaggtatcc atcccttcaa get tt tat eg tttgtgttac 1800 
agacaatcca attatacttt tttggttatt ttagttttta aaagtatttg attatttatt 1860 
tatttattta tttttgagac agagtctcac tctgtcaccc aggcaggagt gcagtggcat 1920 
gatctegget cactgcaacc tccgcctccc aggttcaagc aattttcctg cctcagtctc 1980 
ctgagtagct aggactacag gcacctgcca ccacacctgg ctaatttttt tgtattttta 2040 
gtagagaegg tttcatcatg ttggccaggc tagtcttgat atcctgacct cgtgatctgc 2100 
ccgccttggt ctcccaaagt geegggatta caggtgtcag caactgcgcc tggcctctct 2160 
tttggttatt taaaagtgta caattaaatt atgattatta ttattatttt tgagatggat 2220 
tcttgttctg tcacccaggc tggagtgcag tggcgtgatc ttggcttact gcaaacctcc 2280 
gcctgttggg ttcaagcaat tatcttgect cgggtgtaca ctgccacaca eggctaaett 2340 
atgtattttt aatagagata gggcttcacc atgttggcta gactggtctt gacctcttga 2400 
cctcaagtga tccactcact tcagcctccc agagtgctgg aattacaggc acgagccacc 2460 
acacctggcc ccagttaaat tattattgac tatagtcacc ctgttgtgct atcaaatagt 2520 
aggtcttatt cattcttctt tttttttttt tttttgtgac agagttgece aggctggaat 2580 
gcagtggtgc aatcttggct cactgcaacc tctgcctccc gggcttaagc gattctcctg 2640 
cctcagcctt ctgagtcget gggactacag gtgtgtgcca ccacgcccgg ctaatttatg 2 700 
tatttttagt agagatgggg tttcaccatg ttggccaggc tggtttcgaa ctcctgacct 2760 
caagtgaccc acctgcctca gcttcccaaa gtgttggaat tacaggcatg agccaccaca 2820 
cctggcccca gttaaattat tattcactgg agtcactttg ttgtgctatc aaatagtttt 2 880 
ctaactattt tttttgtacc cattaaccac cctcccaatt tccccccaac cctgccacta 2940 
cccttcccag cctttggtaa ccatccttct actctctatg tccatgaatt caattgtagg 3000 
gtctactgat ttaaaggcta atcacattta gacactcagg agcaagaata attttagtaa 3060 
ttgaactagg attctgecat atgacctcca acatcattag cacctgtgta aattgtatca 3120 
taaaataatt atggaactat tatggaaatg tccctctctc ccagatccca ccttgtacca 3180 
aaatgeaagg tacaaccccg ggaattctga gctccatcct agtcttaccc tgtgctaatt 3240 
cagtctgggt catttcttga attttctggt aaattctcct ttctaccctt tctaactata 3300 
tgtatttgtc aggttaagct agaagtgtta attttttttt tttttgagat ggagccttgc 3360 
tttgtcacct aggctgaagt gcagtggcat gatctcagct cactgcaagc tccgcctccc 3420 

2 
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gggttcatgc cattctcctg cctcagcctc ctgagtagct gggactacag gcacccgcca 3480 
ccatgcttgg ctaatttttt gaattcttag tagagacggg gtttcaccat gttagccagg 3540 
atggtctcga tctcctgacc tcgtgatcca cccgcctcgg ccccctaaag tgctgggatt 3600 " 
acaggcgtga gccactgagc ccggacgaaa tgttaatttg ttttttttga gacggagtct 3 660 
cactctgtca tccaagctgg agtgcagtgg catgatcttg gcttgttgca a^ctctgcct 3720 
ctctggttca agtgattttc ctgcctcagc ctccagcatg actgggatta caggcccgca 3780 
ccaccatgcc cagctaattt ttgtattttt taatagagat ggggtttcac catgttggcc 3 84 0 
aggctggtct tcaactcctg atctcaagta atctgcctgc cttggcctcc caaagtcctg 3900 
ggattacagg catgagccac ggagcccagc ctagaaatgt taatttctaa cgcatgtcag 3 960 
attccatgca cactgggcaa ggttccattc ctccatgggg tgactcaggg atccaggcca 4020 
attgcatatt gagactcttt catattatcc tgtggccttc aaagtcgtca cctctaggga 4080 
tgagaaacaa aagggaaagc cagctggtag ggtcttggac aagaagaaag acatcacttc 414 0 
tgctcacatt ctcttttgac aaaactcagt cacatggtcc caatatatct tcgaggtggc 4200 
tgagtaatgt tatcttccta tgtgtcaagc agaggaaata atgtagtgaa gacacaggat 4260 
ggtctctgaa atatcatctc aggcatgaaa gtagagcata ttcacttgag tgagcctcca 4320 
gtggtgtgaa gttgatggca ggagaaagag ctggggaaga aaaggccagt ggcaggtctc 4380 
ccctcctagc cctatgcagc cccacagtgg gacccttgca tggacctcaa ccatcagaat 4440 
cttttctttt gcaggtcgtt actctctgac ctatatctac actgggctgt ccaagcatgt 4500 
tgaagacgtc cccgcgtttc aggcccttgg ctcactcaat gacctccagt tctttagata 4560 
caacagtaaa gacaggaagt ctcagcccat gggactctgg agacaggtgg aaggaatgga 4620 
ggattggaag caggacagcc aacttcagaa ggccagggag gacatcttta tggagaccct 4680 
gaaagacatt gtggagtatt acaacgacag taacggtcag tgaataacag accacagggg 4 74 0 
tggaaggtct aacccaagag gcagcccccc cagtgtgagt ggcaagggat cagcaggatg 4800 
gaaatagtcc caatcccagg ggaagaacag gagacacagc agaaacacag acatgtccgc 4 860 
atcccaccca ccccacagca caggtgctcc ccgcttcccc atcaattgcc ccatcctcat 4920 
cccaggcctc aggtcacaca ggaagtgatg gcagagtcac ttcctatcca ggcacctatg 4980 
acctctcacc tccacacccc acccatcgga ggctgatacc cccgtgagaa ggcatcagac 5040 
tcacccctgt ccagggaggt tgcctggaga gtgagccact ctcaaagtca ctcagacctg 5100 
ggctcacctg gtggttctgc cagtcctagc tgttgacagt gaaacgttcc caaaatatct 5160 
ggttgaaatc tgcaaacatt ggagcactga gacctacctc caaacaagtc tgtaatattt 5220 
aactatgtct gttctatgaa ggatgtcaca gtctgtcctg atctcccttg cagctccatc 5280 
acctagcaca gggtacagcc aatattggct caattgaaat ttgtggaatc cacagagaaa 534 0 
agcacccggc acacaccgta gcccatgctg ggggctcagg aagtgctgga ttcaaaactg 5400 
tgggctgtta gagttccttg gagccctaaa gttcctcctt accatacgat gcagacccag 5460 
gaagggccac ctgcgctatg gtcagaggag ctggtggcag agcccgtgca gagatggtcc 5520 
ctgtgccccc ggcccagtgc tctttctcct aaaccacact gccagcccca aggcagccaa 55 80 
cctcaggtct ggtgaactgc tggtgttaaa ttatcataga gtgggtgtca aaagatgggc 564 0 
tactaagtac aaaaatgccc aaggtgctac atgggatctg aagattttca aaaggaggca 5700 
agaaagagat aggcagatgt ttcaaggatg tggggtgggg gaggtcttgg taaggaaaat 5760 
ggcccaggct gtgtgtcagc aataggagag gagggggcac aggtgatcag aaaagacact 582 0 
gggggaagca ttgatggaca ggaatagaaa tggcaaagtg gataattaag aggaaggagg 5880 
atgaggagat gaacacaggg tattagaaaa taatagaagg cagggcttgg tggctcactc 594 0 
ttgtaatccc agcactttgg gaggctgagg caggcagatc acctaaggtc aggagttcga 6000 
gaccagcccg gccaacatgg tgaaaccctg tctctactaa taatacaaaa atagcctggc 6060 
atggtggcac acgtctgtgg tcccagctac tcaggaggct gaggcaggag aattgcttga 612 0 
acccaggagg cagaggttac agtggccaaa atcctaccat tgcactacag cctgggtgac 6180 
aagagtgaaa cgttgtctaa aaacaaaaaa caaaaaacaa aaaaaggaaa taatagtagc 6240 
tgacatttac tgagcactta ctttgtgcca ggcccatcta tgagcatata taatgctcag 63 00 

3 
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aatagccccc taaaacagtg ctcttggcat tgccatttca gaggtgagga aatagaggca 6360 
cagggagttg agtggctcca gttcaggcaa cacaccaggt gggggtgggg ggctggggag 64 20 
agacctggga cgtgagccca gacagcttga gagctttcag agtctatgcc aacagcacca 6480 
accagtgctg ggtaaacacc tgcttttatc atcagaacaa agaggctgtg tcccctgccc 6540 
tatgaggtcc atttctgaga gttgtggcta atgggcaaga aggttggggc tttSgagatt 6600 
tgggataaag atatcaaaca ccagaaaggt agaaagaagt gatcagatta gggttactta 6660 
ggtgatgata tgaactcttc ctagaactga gagaaaaaga gagccttcct ttactcatat 6720 
gaaatcacaa ataatttcta tccaatttgg aagtacactt tggtgtagtt gtgacagctt 6780 
cctcaggact cagcataaat tcaaacaaat aattgtcctt agaagagatg ctatagaaga 6840 
gatagaaata tattcatatt ctgtagcttt tttttttttg agatggagtt ttgctcttgt 6900 
cacccaagct ggagtgcagt gatgcaatct cagctcactg caaactttgc ctcctgggtt 6960 
caagggattc tcctgcctca gcctcccgat aactgggact acaggctaca ggcatgtgtc 7020 
actactcctg gttaattttt tttttttttt tttaagactg agtcttgctc tgtctttcag 7080 
gctgatgtac aatggctcca tctcggctca ctacaacttc tgtcccccag gttcaagcga 7140 
ttctcctgcc tcagcctcat gagtagctgg gattacaggc atgtgccagc acacccagca 7200 
aatttttgta tttttagtag agatgaggtc ttaccatgtt ggccaggctg gtctcaaact 7260 
cctgacctca ggtgatcctt tggcctcagc ctccctaact gctgggatta caggcatgag 7320 
ccactgcgtc cagcctaatt ttatattttt ggtagagatg gggtttcacc atattggcca 7380 
ggctggtctc gaactcatga cctaaggtga tccatcctcc tcagcctctc aaagtgctgg 7440 
gattacaagt gtgagccact gggcctggtg cttttttttt tttttttttt tttttttttt 7500 
tgagataggg tctcactctg tcacccaggc tgaaatgcag tagtgtgatt ttggctcatt 7560 
gcagccttga cttcccaggc tgaagtgatc ctcccacctc agcctcctga gtagctgggg 7620 
ctacaggcat gcaccaccat gctgcgctaa tttttatatt ttttgtagtg gtgggatttc 7680 
gccatatcac cctggctggt ctggaacccc tgggctcaag cgatccactc gcttcagctt 7740 
ctcaaagtgc tgggattaca ggcatgagcc acagcgccca ggctgtagct ctcttaagga 7800 
ggaacatatc tcatctgaga caaacctgaa atgccaaacc aaactgagtt agcccctctc 7860 
tgtctgttgt atatattgga gtaataacct atttgtcttg ataaagggat tgcatgcttg 7920 
aattgcaaaa acctttattt cttttgggtt gcccaatgtg caagactaag agttattttg 7980 
ataaatttct caccaggctg actgtctctc tgtggggtcg ggggagtttt cagggtctca 8040 
cgtattgcag ggaaggtttg gttgtgagat cgagaataac agaagcagcg gagcattctg 8100 
gaaatattac tatgatggaa aggactacat tgaattcaac aaagaaatcc cagcctgggt 8160 
ccccttcgac ccagcagccc agataaccaa gcagaagtgg gaggcagaac cagtctacgt 8220 
gcagcgggcc aaggcttacc tggaggagga gtgccctgcg actctgcgga aatacctgaa 82 80 
atacagcaaa aatatcctgg accggcaagg tactcactgc ttcctgctcc ccagtactga 8340 
gcccagaata aaagacgatc tcaggctagg agctcaggca acatcttagt ccggtctcat 8400 
ctgttcctgg atgtccctca gacccccagc tttcatcttt taggatttat tccttccctg 84 60 
ggataatata atttgtggtc caaaaagaac atcatcaaaa tttcaggcag aatgggccag 8520 
gaaggccatt ctttcttgat gagtgtcccc aaatcatctc caattaacag acaaggagct 8580 
tgaggttagg gaggtgaggg taacactgtc tgtaagaggc agagctggga ctcaaattcc 8 64 0 
agatttcaga ttccaaatcc catcgttttt tatctctaca atgatgcctc ccatctgggt 8700 
ggtggagaga agggaggcgt gtaaaagtca gccccagaag gacaagagca agccagtgtg 8760 
agcggaattg atggctgcaa gctgagactt ggattggaga cgtagtgaga ctcaggattg 8820 
tgcagtgctg cagggaagtg gttgctggat agaggcatgg gctgaaccaa gcagctggac 8880 
tgagactggg ggacagaact ccaaagccca ctgagatgtg ggaaaacatg gagaagcaca 8940 
cggagcattc acaacttatt gccgtcagag tcaatacatg ggtgaggtgg ggattgggca 9000 
agagggaaag cgtcagcctt ccctgatatt ctggaaagtc tcccggggct gggggtgggc 9060 
aggtacagag cttcgagctc tgctgatcgc tgacatccag gggtgggggt aggaagagac 9120 
ctgggccggg agaagtccac ctcaagcctg cagtgtcaca ctctatccct ccacagatcc 9180 
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tccctctgtg gtggtcacca gccaccaggc cccaggagaa aagaagaaac tgaagtgcct 9240 
ggcctacgac ttctacccag ggaaaattga tgtgcactgg actcgggccg gcgaggtgca 93 00 
ggagcctgag ttacggggag atgttcttca caatggaaat ggcacttacc agtcctgggt 9360 
ggtggtggca gtgcccccgc aggacacagc cccctactcc tgccacgtgc agcacagcag 9420 
cctggcccag cccctcgtgg tgccctggga ggccagctag gaagcaaggg ttggaggcaa 94 80 
tgtgggatct cagacccagt agctgccctt cctgcctgat gtgggagctg aaccacagaa 954 0 
atcacagtca atggatccac aaggcctgag gagcagtgtg gggggacaga caggaggtgg 9600 
atttggagac cgaagactgg gatgcctgtc ttgagtagac ttggacccaa aaaatcatct 9660 
caccttgagc ccacccccac cccattgtct aatctgtaga agctaataaa taatcatccc 9720 
tccttgccta gcataacaga gaatcctttt tttaacggtg atgcgctgta gaaatgtgac 9780 
tagattttct cattggttct gccctcaagc actgaattc 9819 

<210> 3 
<211> 250 
<212> DNA 

<213> Homo sapiens 



cgcccctgcg ccgccgagcc agctgccaga atgccgaact ggggaggagg caagaaatgt 60 

ggggtgtgtc agaagacggt ttactttgcc gaagaggttc agtgcgaagg caacagcttc 120 

cataaatcct gcttcctgtg catggtctgc aagaagaatc tggacagtac cactgtggcc 180 

gtgcatggtg aggagattta ctgcaagtcc tgctacggca agaagtatgg gcccaaaggc 24 0 
tatggctacg ^ 250 

<210> 4 
<211> 1900 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> unsure 
<222> (16) 

<220> 

<221> unsure 
<222> (18) 

<220> 

<221> unsure 
<222> (20) 

<220> 

<221> unsure 
<222> (1887) 

<220> 

<221> unsure 
<222> (1894) 



<400> 3 
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<400> 4 

acgccttccg cggagnanan caaaacggcg 
cgagagcgcc tgccgcccct ggcgccgccg 
gaggcaagaa atgtggggtg tgtcaagaag 
gaaggcaaca gcttccataa atcctgcttc 
agtaccactg tgggccgtgc atggtgagga 
aagtatgggc ccaaaggcta tggctacggg 
gggggagtcg ctgggtatca agcacgagga 
ccaatggcat ccaaatttgc ccagaagatt 
caggcagtct atgctgcgga gaaggtgatt 
tttcgatgtg ccaagtgtgg caaaggcctt 
cgagatttac tgcaaaggat gttatgctaa 
gcaaggagct ggggccttgg tccactctga 
ccactcctgc gcttttcatc gccattccat 
tttctctgtc agccctgcca catatcacta 
tggtttgggg gtctgcctga ggtcccaccc 
acaccatcac cagtaggaga cctcagtgtt 
ccacacctcg ccccacagag ctctgttctt 
tgaccaagac acctgaggac acatcttggc 
ggagagggaa gcaagaccaa gatgaggagg 
attctcctct gtgggaaaga ggttgagctt 
tcccagctta gggagttcac tattggaggc 
ccctgcttct ccaggcctct tgcctttgag 
actgggagga gaataaccca ggtcttaagg 
tcaaacatct agttccctgc ttgatgggag 
ggcatttatc aatggctcaa atcttcattt 
tgcggccagc agagcccagg ccagggctct 
gtggagggag gtaggcactg cctcagtctt 
cctcagaatc ttccctttaa cccaagaccc 
cccttagatc acatcactcc acccctgcca 
aggggaaagg gctgggcctc accgctccca 
acccactgaa agggctgcag gcatgggctg 
aagctgttta gaccagaaaa aaaaaanaaa 



cgcaggccgg gcgcacccag ccgccacttc 60 
agccagctgc cagaatgccg aactggggag 12 0 
acggtttact ttgccgaaga ggt?cagtgc 18 0 
ctgtgcatgg tctgcaagaa gaatctggac 24 0 
gatttactgg caagtccctg ctacggcaag 300 
ccagggcgca ggcaccctca gcactgacaa 360 
agcccctggg ccacaggccc accaccaacc 420 
ggtggctccg agcgctgccc ccgatgcagc 48 0 
ggtgctggga agtcctggca taaggcctgc 54 0 
gagtcaacca ccctgggcag acaaggatgg 600 
aaacttcggg cccaagggct ttggttttgg 660 
gtgaggccac catcacccac cacaccctgc 720 
tcccagcagc tttggagacc tccaggatta 780 
atgacttgaa cttgggcatc tggctccctt 840 
cactaaaggg ctccccaggc ctgggatctg 900 
ttgggtctag gtgagagcag gcccctctcc 960 
agcctcctgt gctgcgtgtc catcatcagc 1020 
acccagagga gcagcagcaa caggctggag 10 80 
ggggaaggct gggttttttg gatctcagag 114 0 
cctggtgtcc ctcagagtaa gcctgaggag 1200 
agagaggcat gcaggcaggg tcctaggagc 1260 
tctttgtgga atggatagcc tcccactagg 1320 
accccaaagt caggatgttg tttgatcttc 1380 
gatcctaatg aaatacctga aacatatatt 1440 
atctctggcc ttaaccctgg ctcctgaggc 1500 
gttcttgcca cacctgcttg atcctcagat 1560 
catccaaaca cctttccctt tgccctgaga 1620 
tgcctcttcc actccaccct tctccaggga 1680 
ggccccaggt taggaatagt ggtgggagga 174 0 
gcaactgaaa ggacaacact atctggagcc 1800 
tacccaagct gatttctcat ctggtcaata 1860 
aaanaaaagg 1900 



<210> 5 
<211> 273 
<212> DNA 

<213> Homo sapiens 



<400> 5 

gatgcatcaa aagagctgca agttctccac 
tctcaccaca catgggagtt ccaaacgagc 
cacctgcggc tggtgctgaa ccagccccta 
agcacagggc cgaatcctag catcgccaaa 
ccctcagacc actacaactg gcaggcaacc 



attgacttct tgaatcagga caacgccgtt 60 
agtcctgtgt tccggcgagg acaggtgttt 12 0 
caatcctacc accaactgaa actggaattc 180 
cacaccctgg tggtgctcga cccgaggacg 24 0 
ctt 273 



<210> 6 
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<211> 3021 
<212> DNA 

<213> Homo sapiens 

<400> 6 * 
tgtggaagca ccaggcatca gagatagagt cttccctggc attgcaggag agaatctgaa 60 
gggatgatgg atgcatcaaa agagctgcaa gttctccaca ttgacttctt gaatcaggac 120 
aacgccgttt ctcaccacac atgggagttc caaacgagca gtcctgtgtt ccggcgagga 180 
caggtgtttc acctgcggct ggtgctgaac cagcccctac aatcctacca ccaactgaaa 24 0 
ctggaattca gcacagggcc gaatcctagc atcgccaaac acaccctggt ggtgctcgac 300 
ccgaggacgc cctcagacca ctacaactgg caggcaaccc ttcaaaatga gtctggcaaa 360 
gaggtcacag tggctgtcac cagttccccc aatgccatcc tgggcaagta ccaactaaac 420 
gtgaaaactg gaaaccacat ccttaagtct gaagaaaaca tcctatacct tctcttcaac 4 80 
ccatggtgta aagaggacat ggttttcatg cctgatgagg acgagcgcaa agagtacatc 54 0 
ctcaatgaca cgggctgcca ttacgtgggg gctgccagaa gtatcaaatg caaaccctgg 600 
aactttggtc agtttgagaa aaatgtcctg gactgctgca tttccctgct gactgagagc 660 
tccctcaagc ccacagatag gagggacccc gtgctggtgt gcagggccat gtgtgctatg 720 
atgagctttg agaaaggcca gggcgtgctc attgggaatt ggactgggga ctatgaaggt 780 
ggcacagccc catacaagtg gacaggcagt gccccgatcc tgcagcagta ctacaacacg 84 0 
aagcaggctg tgtgctttgg ccagtgctgg gtgtttgctg ggatcctgac tacagtgctg 900 
agagcgttgg gcatcccagc acgcagtgtg acaggcttcg attcagctca cgacacagaa 960 
aggaacctca cggtggacac ctatgtgaat gagaatggca agaaaatcac cagtatgacc 1020 
cacgactctg tctggaattt ccatgtgtgg acggatgcct ggatgaagcg accggatctg 1080 
cccaagggct acgacggctg gcaggctgtg gacgcaacgc cgcaggagcg aagccagggt 114 0 
gtcttctgct gtgggccatc accactgacc gccatccgca aaggtgacat ctttattgtc 1200 
tatgacacca gattcgtctt ctcagaagtg aatggtgaca ggctcatctg gttggtgaag 1260 
atggtgaatg ggcaggagga gttacacgta atttcaatgg agaccacaag catcgggaaa 1320 
aacatcagca ccaaggcagt gggccaagac aggcggagag atatcaccta tgagtacaag 13 80 
tatccagaag gctcctctga ggagaggcag gttcatggat catgccttcc tccttctcag 1440 
ttctgagagg gagcacagac gacctgtaaa agagaacttt cttcacatgt cggtacaatc 1500 
agatgatgtg ctgctgggaa actctgttaa tttcaccgtg attcttaaaa ggaagaccgc 1560 
tgccctacag aatgtcaaca tcttgggctc ctctgaacta cagttgtaca ctggcaagaa 1620 
gatggcaaaa ctgtgtgacc tcaataagac ctcgcagatc caaggtcaag tatcagaagt 1680 
gactctgacc ttggactcca agacctacat caacagcctg gctatattag atgatgagcc 1740 
agttatcaga ggtttcatca ttgcggaaat tgtggagtct aaggaaatca tggcctctga 1800 
agtattcacg tctttccagt accctgagtt ctctatagag ttgcctaaca caggcagaat 1860 
tggccagcta cttgtctgca attgtatctt caagaatacc ctggccatcc ccttgactga 192 0 
cgtcaagttc tctttggaaa gcctgggcat ctcctcacta cagacctctg accatgggtg 1980 
agtctgcctg aggacggtgc agcctggtga gaccatccaa tcccaaataa aatgcacccc 2040 
aataaaaatg gacccaagaa atttatcgtc aagttaagtt ccaaacaagt gaaagagatt 2100 
aatgctcaga agattgttct catcaccaag tagccttgtc tgatgctgtg gagccttagt 2160 
tgagatttca gcatttccta ccttgtggct tagctttcag attatggatg attaaatttg 2220 
atgacttata tgagggcaga ttcaagagcc agcaggtcaa aaaggccaac acaaccataa 2280 
gcagccagac ccacaaggcc aggtcctgtg ctatcacagg gtcaccttct tttacagtta 2340 
gaaacaccag ccgaggccac agaatcccat ccctttcctg agtcatggcc tcaaaaatca 2400 
gggccaccat tgtctcaatt caaatccata gatttcgaag ccacagattc tctccccgga 2460 
gcaagcatga ctatgggcag cccagtgctg ccacctgctg acgacccttg agaagctgcc 2520 
atatcttcag gccatgggtt caccagccct gaaggcacct gtcaactgga gtgctctctc 2580 
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agcactggga tgggcctgat agaagtgcat 

ctatccctga aatccaggaa gtccctctcc 

gcaaggacat ttctcaaggg ccatgtggtt 

tcaccataga gacccatgtc agcaaacggt 

gctgcccctt gggagactcc agggagaagg 

tttggtattc catccactat cctggcaact 

ccttcttgtt ctgccctcca gagatttgct 

tacttcaaga aaaaaaaacc g 
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tctcctccta ttgcctccat tctcctctct 2640 
tggtgctcca agcagtttga agcccaatct 2700 
ttgcagacaa ccctgtcctc aggcctgaac 2760 
gaccagcaaa tcctcttccc ttattctaaa 2820 
cattgcttcc tccctggtgt gaactctttc 2880 
caaggctgct tctgttaact gaagcctgct 2940 
caaatgatca ataagcttta aattaaactc 3000 

3021 



<210> 7 

<211> 267 

<212> DNA 

<213> Homo sapiens 



<400> 7 

gaacattcca gatacctatc attactcgat 
tcagggtcac caccagctat tggaccttac 
ccctatcccg cacagcccac tgtggtcccc 
tacccgtccc ccgtgcccca gtacgccccg 
gtctgcacgc agcccaaatc cccatcc 



gctgttgata acagcaagat ggctttgaac 60 
tatgaaaacc atggatacca accggaaaac 12 0 
actgtctacg aggtgcatcc ggctcagtac 180 
agggtcctga cgcaggcttc caaccccgtc 24 0 

267 



<210> 8 

<211> 3443 

<212> DMA 

<213> Homo sapiens 



<400> 8 

gggcgggccg ggccgagtag gcgcgagcta 
aggggcgggg agcgccgcct ggagcgcggc 
attactcgat gctgttgata acagcaagat 
tggaccttac tatgaaaacc atggatacca 
tgtggtcccc actgtctacg aggtgcatcc 
gtacgccccg agggtcctga cgcaggcttc 
cccatccggg acagtgtgca cctcaaagac 
ggggaccttc ctcgtgggag ctgcgctggc 
caagtgctcc aactctggga tagagtgcga 
ctggtgtgat ggcgtgtcac actgccccgg 
ctacggacca aacttcatcc ttcaggtgta 
gtgccaagac gactggaacg agaactacgg 
gaataatttt tactctagcc aaggaatagt 
actgaacaca agtgccggca atgtcgatat 
ttcttcaaaa gcagtggttt ctttacgctg 
ccgccagagc aggatcgtgg gcggcgagag 
tcagcctgca cgtccagaac gtccacgtgt 
tcgtgacagc cgcccactgc gtggaaaaac 
ttgcggggat tttgagacaa tctttcatgt 
tgatttctca tccaaattat gactccaaga 
tgcagaagcc tctgactttc aacgacctag 



agcaggaggc ggaggcggag gcggagggcg 60 
aggtcatatt gaacattcca gatacctatc 120 
ggctttgaac tcagggtcac caccagctat 180 
accggaaaac ccctatcccg cacagcccac 240 
ggctcagtac tacccgtccc ccgtgcccca 300 
caaccccgtc gtctgcacgc agcccaaatc 360 
taagaaagca ctgtgcatca ccttgaccct 420 
cgctggccta ctctggaagt tcatgggcag 480 
ctcctcaggt acctgcatca acccctctaa 540 
cggggaggac gagaatcggt gtgttcgcct 600 
ctcatctcag aggaagtcct ggcaccctgt 660 
gcgggcggcc tgcagggaca tgggctataa 720 
ggatgacagc ggatccacca gctttatgaa 780 
ctataaaaaa ctgtaccaca gtgatgcctg 84 0 
tatagcctgc ggggtcaact tgaactcaag 900 
cgcgctcccg ggggcctggc cctgggcagg 96 0 
gcggaggctc catcatcacc cccgagtgga 1020 
ctcttaacaa tccatggcat tggacggcat 1080 
tctatggagc cggataccaa gtagaaaaag 1140 
ccaagaacaa tgacattgcg ctgatgaagc 12 00 
tgaaaccagt gtgtctgccc aacccaggca 12 60 
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tgatgctgca gccagaacag ctctgctgga tttccgggtg gggggccacc gaggagaaag 1320 
ggaagacctc agaagtgctg aacgctgcca aggtgcttct cattgagaca cagagatgca 1380 
acagcagata tgtctatgac aacctgatca caccagccat gatctgtgcc ggcttcctgc 144 0 
aggggaacgt cgattcttgc cagggtgaca gtggagggcc tctggtcact tcgaagaaca 1500 
atatctggtg gctgataggg gatacaagct ggggttctgg ctgtgccaaa gcttacagac 1560 
caggagtgta cgggaatgtg atggtattca cggactggat ttatcgacaa atgagggcag 1620 
acggctaatc cacatggtct tcgtccttga cgtcgtttta caagaaaaca atggggctgg 1680 
ttttgcttcc ccgtgcatga tttactctta gagatgattc agaggtcact tcatttttat 1740 
taaacagtga acttgtctgg ctttggcact ctctgccatt ctgtgcaggc tgcagtggct 1800 
cccctgccca gcctgctctc cctaacccct tgtccgcaag gggtgatggc cggctggttg i860 
tgggcactgg cggtcaagtg tggaggagag gggtggaggc tgccccattg agatcttcct 1920 
gctgagtcct ttccaggggc caattttgga tgagcatgga gctgtcacct ctcagctgct 1980 
ggatgacttg agatgaaaaa ggagagacat ggaaagggag acagccaggt ggcacctgca 204 0 
gcggctgcct ctggggccac ttggtagtgt ccccagccta cctctccaca aggggatttt 2100 
gctgatgggt tcttagagcc ttagcagccc tggatggtgg ccagaaataa agggaccagc 2160 
ccttcatggg tggtgacgtg gtagtcacct tgtaagggga acagaaacat ttttgttctt 222 0 
atggggtgag aatatagaca gtgcccttgg gtgcgaggga agcaattgaa aaggaacttg 2280 
ccctgagcac tcctggtgca ggtctccacc tgcacattgg gtggggctcc tgggagggag 234 0 
actcagcctt cctcctcatc ctccctgacc ctgctcctag caccctggag agtgcacatg 2400 
ccccttggtc ctgggcaggg gcgccaagtc tggcaccatg ttggcctctt caggcctgct 2460 
agtcactgga aattgaggtc catgggggaa atcaaggatg ctcagtttaa ggtacactgt 2520 
ttccatgtta tgtttctaca cattgctacc tcagtgctcc tggaaactta gcttttgatg 2580 
tctccaagta gtccaccttc atttaactct ttgaaactgt atcatctttg ccaagtaaga 2640 
gtggtggcct atttcagctg ctttgacaaa atgactggct ccCgacttaa cgttctataa 2700 
atgaatgtgc tgaagcaaag tgcccatggt ggcggcgaag aagagaaaga tgtgttttgt 2760 
tttggactct ctgtggtccc ttccaatgct gtgggtttcc aaccagggga agggtccctt 2820 
ttgcattgcc aagtgccata accatgagca ctactctacc atggttctgc ctcctggcca 2880 
agcaggctgg tttgcaagaa tgaaatgaat gattctacag ctaggactta accttgaaat 2940 
ggaaagtctt gcaatcccat ttgcaggatc cgtctgtgca catgcctctg tagagagcag 3000 
cattcccagg gaccttggaa acagttggca ctgtaaggtg cttgctcccc aagacacatc 3060 
ctaaaaggtg ttgtaatggt gaaaacgtct tccttcttta ttgccccttc ttatttatgt 3120 
gaacaactgt ttgtcttttt ttgtatcttt tttaaactgt aaagttcaat tgtgaaaatg 3180 
aatatcatgc aaataaatta tgcgattttt ttttcaaagt aaccactgca tctttgaagt 3240 
tctgcctggt gagtaggacc agcctccatt tccttataag ggggtgatgt tgaggctgct 3300 
ggtcagagga ccaaaggtga ggcaaggcca gacttggtgc tcctgtggtt ggtgccctca 3360 
gttcctgcag cctgtcctgt tggagaggtc cctcaaatga ctccttctta ttattctatt 3420 
agtctgtttc catgggcgtg ata 3443 

<210> 9 
<211> 254 
<212> DNA 

<213> Homo sapiens 
<400> 9 

gtgctgcacc aggccaccat cctgcccaag actgggacag tgtccctgga ggtacggctc 60 
ctggaggcct cccgtgcctt cgaggtgtca gagaacggca acctggtagt gagtgggaag 120 
gtgcaccagt gggatgaccc tgaccccagg ctcttcgacc acccggaaag ccccaccccc 180 
aaccccacgg agcccctctt cctggcccag gctgaagttt acaaggagct gcgtctgcgt 240 
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ggctacgact acgg 254 

<210> 10 

<211> 8470 

<212> DNA 

<213> Homo sapiens 

<220> 

<22l> unsure 
<222> (4131) 

<220> 

<221> unsure 
<222> (5117) 

<220> 

<221> unsure 
<222> (5552) 

<400> 10 

cggccgtcga cacggcagcg gccccggcct ccctctccgc cgcgcttcag cctcccgctc 60 
cgccgcgctc cagcctcgct ctccgccgcc cgcaccgccg cccgcgccct caccagagca 12 0 
gccatggagg aggtggtgat tgccggcatg tccgggaagc tgccagagtc ggagaacttg 180 
caggagttct gggacaacct catcggcggt gtggacatgg tcacggacga tgaccgtcgc 240 
tggaaggcgg ggctctacgg cctgccccgg cggtccggca agctgaagga cctgtctagg 3 00 
tttgatgcct ccttcttcgg agtccacccc aagcaggcac acacgatgga ccctcagctg 360 
cggctgctgc tggaagtcac ctatgaagcc atcgtggacg gaggcatcaa cccagattca 420 
ctccgaggaa cacacactgg cgtctgggtg ggcgtgagcg gctctgagac ctcggaggcc 4 80 
ctgagccgag accccgagac actcgtgggc tacagcatgg tgggctgcca gcgagcgatg 540 
atggccaacc ggctctcctt cttcttcgac ttcagagggc ccagcatcgc actggacaca 600 
gcctgctcct ccagcctgat ggccctgcag aacgcctacc aggccatcca cagcgggcag 660 
tgccctgccg ccatcgtggg gggcatcaat gtcctgctga agcccaacac ctccgtgcag 720 
tztcttgaggc tggggatgct cagccccgag ggcacctgca aggccttcga cacagcgggg 7 80 
aatgggtact gccgctcgga gggtgtggtg gccgtcctgc tgaccaagaa gtccctggcc 840 
cggcgggtgt acgccaccat cctgaacgcc ggcaccaata cagatggctt caaggagcaa 900 
ggcgtgacct tcccctcagg ggatatccag gagcagctca tccgctcgtt gtaccagtcg 960 
gccggagtgg cccctgagtc atttgaatac atcgaagccc acggcacagg caccaaggtg 102 0 
ggcgaccccc aggagctgaa tggcatcacc cgagccctgt gcgccacccg ccaggagccg 1080 
ctgctcatcg gctccaccaa gtccaacatg gggcacccgg agccagcctc ggggctggca 1140 
gccctggcca aggtgctgct gtccctggag cacgggctct gggcccccaa cctgcacttc 1200 
catagcccca accctgagat cccagcgctg ttggatgggc ggctgcaggt ggtggaccag 12 60 
cccctgcccg tccgtggcgg caacgtgggc atcaactcct ttggcttcgg gggctccaaa 1320 
cgtgcacatc atcctgaggc ccaacacgca gccgcccccc gcacccggcc cacatgccac 1380 
cctgccccgt ctgctgcggg ccagcggacg cacccctgag gccgtgcaga agctgctgga 1440 
gcagggcctc cggcacagcc agggcctggc tttcctgagc atgtgaacga catcgcggct 1500 
gtccccgacc accgccatgc ccttccgtgg ctacgctgtg ctgggtggtg agacgcggtg 1560 
gcccagaggt gcagcaggtg cccgctggcg agcgcccgct ctggttcatc tgctctggga 1620 
tgggcacaca gtggcgcggg atggggctga gcctcatgcg cctggaccgc ttccgagatt 1680 
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ccatcctacg ctccgatgag gctgtgaacc gattcggcct gaaggtgtca cagctgctgc 174 0 
tgagcacaga cgagagcacc tttgatgaca tcgtccattc gtttgtgagc ctgactgcca 1800 
tccagatagg cctcatagac ctgctgagct gcatggggct gaggccagat ggcatcgtcg 1860 
gccactccct gggggaggtg gcctgtggct acgccgacgg ctgcctgtcc caggaggagg 192 0 
ccgtcctcgc tgcctactgg aggggacagt gcatcaaaga agcccatctc ccg^cgggcg 1980 
ccatggcagc cgtgggcttg tcctgggagg agtgtaaaca gcgctgcccc ccggcggtgg 2040 
tgcccgccgc cacaactcca aggacacagt caccatctcg ggacctcagg ccccggtgtt 2100 
tgagttcgtg gagcagctga ggaaggaggg tgtgtttgcc aaggaggtgc ggaccggcgg 2160 
tatggccttc cactcctact tcatggaggc catcgcaccc ccactgctgc aggagctcaa 2220 
gaaggtgatc cgggagccga agccacgttc agcccgctgg ctcagcacct ctatccccga 2280 
ggcccagtgg cacagcagcc tggcacgcac gtcctccgcc gagtacaatg tcaacaacct 234 0 
ggtgagccct gtgctgttcc aggaggccct gtggcacgtg cctgagcacg cggtggtgct 2400 
ggagatcgcg ccccacgccc tgctgcaggc tgtcctgaag cgtggcctga agccgagctg 2460 
caccatcatc cccctgatga agaaggatca cagggacaac ctggagttct tcctggccgg 2 52 0 
catcggcagg ctgcacctct caggcatcga cgccaacccc aatgccttgt tcccacctgt 2580 
ggagtcccca gctccccgag gaactcccct catctcccca ctcatcaagt gggaccacag 2640 
cctggcctgg gacgcgccgg ccgccgagga cttccccaac ggttcaggtt ccccctcagc 2700 
caccatctac acatgcacac caagctccga gtctcctgac cgctacctgg tggaccacac 2760 
catcgacggt cgcgtcctct tccccgccac tggctacctg agcatagtgt ggaagacgct 2820 
ggcccgaccc ctgggcctgg gcgtcgagca gctgcctgtg gtgtttgagg atgtggtgct 2880 
gcaccaggcc accatcctgc ccaagactgg gacagtgtcc ctggaggtac ggctcctgga 2 94 0 
ggcctcccgt gccttcgagg tgtcagagaa cggcaacctg gtagtgagtg ggaaggtgta 3000 
ccagtgggat gaccctgacc ccaggctctt cgaccacccg gaaagcccca cccccaaccc 3060 
cacggagccc ctcttcctgg cccaggctga agtttacaag gagctgcgtc tgcgtggcta 312 0 
cgactacggc cctcatttcc agggcatcct ggaggccagc ctggaaggtg actcggggag 3180 
gctgctgtgg aaggataatg ggtgagttca tggacaccat gctgcagatg tccatcctgg 324 0 
gtcggccaag cacggcctgt acctgcccac ccgtgtcacc gccatccaca tcgaccctgc 3300 
cacccacagg cagaagctgt acacactgca ggacaaggcc caagtggctg acgtggtggt 3360 
gagcaggtgg ctgagggtca cagtggccgg aggcgtccac atctccgggc tccacactga 342 0 
gtcggccccg cggcggcagc aggagcagca ggtgcccatc ctggagaagt tttgcttcac 34 80 
tccccacacg gaggaggggt gcctgtctga gcacgctgcc ctcgaggagg agctgcaact 3540 
gtgcaagggg ctggtcgagg cactcgagac caaggtgacc cagcaggggc tgaagatggt 3600 
ggtgcccgga ctggatgggg cccagatccc cccgggaccc ctcacagcag gaactgcccc 3660 
ggctgttgtc ggctgcctgc aggcttcagc tcaacgggaa cctgcagctg gagctggcgc 3720 
aggtgctggc ccaggagagg cccaagctgc cagaggaccc tctgctcagc ggcctcctgg 3780 
actccccggc actcaaggcc tgcctggaca ctgccgtgga gaacatgccc agcctgaaga 3840 
tgaaggtggt ggaggtgctg gccggccacg gtcacctgta ttcccgcatc ccaggcctgc 3 900 
tcagccccca tcccctgctg cagctgagct acacggccac cgaccgccac ccccaggccc 3960 
tggaggctgc ccaggccgag ctgcagcagc acgacgttgc ccagggccag tgggatcccg 4 020 
cagaccctgc ccccagcgcc ctgggcagcg cggacctcct ggtgtgcaac tgtgctgtgg 4080 
ctgccctcgg ggacccgcct cagctctcag caacatggtg gctgccctga nagaaggggg 4140 
ctttctgctc ctgcacacac tgctccgggg gcaccccctc ggggacatcg tggccttcct 4200 
cacctccact gagccgcagt atggccaggg catcctgagc caggacgcgt gggagagcct 4260 
cttctccagg gtgtcgctgc gcctggtggg cctgaagaag tccttctacg gctccacgct 4320 
cttcctgtgc cgccggccca ccccgcagga cagccccatc ttcctgccgg tggacgatac 43 80 
cagcttccgc tgggtggagt ctctgaaggg catcctggct gacgaagact ctttcccggc 4440 
ctgtgtggct gaaggccatc aactgttcca cctcgggcgt ggtgggcttg gtgaactgtc 4500 
tccgccgaga gcccggcgga acgctccggt gtgtgctgct ctccaacctc agcagcacct 4560 
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cccacgtccc ggaggtggac ccgggctccg cagaactgca gaaggtgttg cagggagacc 462 0 
tggtgatgaa cgtctaccgc gacggggcct ggggggcttt ccgccacttc ctgctggagg 4680 
aggacaagcc tgaggagccg acggcacatg cctttgtgag caccctcacc cggggggacc 4740 
tgtccctcca tccgctgggt ctgctcctcg ctgcgccatg cccagcccac ctgccctggc 4800 
gcccagctct gcacggtcta ctacgcctcc ctcaacttcc gcgacatcat gctggccact 4860 
ggcaagctgt cccctgatgc catcccaggg aagtggacct cccaggacag cctgctaggt 4 92 0 
atggagttct cgggccgaga cgccagcggc aagcgtgtga tgggactggt gcctgccaag 4 980 
ggcctggcca cctctgtcct gctgtcaccg gacttcctct gggatgtgcc ttccaactgg 5040 
acgctggagg aggcggcctc ggtgcctgtc gtctacagca cggcctacta cgcgctggtg 5100 
gtgcgtgggc gggtgcnccc cggggagacg ctgctcat cc actcgggctc gggcggcgtg 516 0 
ggccaggccg ccatcgccat cgccctcagt ctgggctgcc gcgtcttcac caccgtgggg 522 0 
tcggctgaga agcgggcgta cctccaggcc aggttccccc agctcgacag caccagcttc 52 8 0 
gccaactccc gggacacatc cttcgagcag catgtgctgt ggcacacggg cgggaagggc 53 4 0 
gttgacctgg tcttgaactc cttggcggaa gagaagctgc aggccagcgt gaggtgcttg 54 0 0 
gctacgcacg gtcgcttcct ggaaattggc aaattcgacc tttctcagaa ccacccgctc 5460 
ggcatggcta tcttcctgaa gaacgtgaca ttccacgggg tcctactgga tgcgttcttc 5520 
aacgagagca gtgctgactg gcgggaggtg tnggcgcttg tgcaggccgg catccgggat 55 8 0 
ggggtggtac ggcccctcaa gtgcacggtg ttccatgggg cccaggtgga ggacgccttc 564 0 
cgctacatgg cccaagggaa gcacattggc aaagtcgtcg tgcaggtgct tgcggaggag 5700 
ccggaggcag tggctgaagg gggccaaacc caagctgatg tcggccatct ccaagacctt 5760 
ctgcccggcc cacaagagct acatcatcgc tggtggtctg ggtggcttcg gcctggagtt 5820 
ggcgcagtgg ctgatacagc gtggggtgca gaagctcgtg ttgacttctc gctccgggat 5880 
ccggacaggc taccaggcca agcaggtccg ccggtggagg cgccagggcg tacaggtgca 594 0 
ggtgtccacc agcaacatca gctcactgga gggggcccgg ggcctcattg ccgaggcggc 60 00 
gcagcttgag gcccgtgggc ggcgtcttca acctggccgt ggtcttgaga gatggcttgc 6060 
tggagaacca gaccccagag ttcttccagg acgtctgcaa gcccaagtac agcggcaccc 612 0 
tgaacctgga cagggtgacc cgagggcgtg ccctgagctg gactactttg tggtcttctc 6180 
ctctgtgagc tgcgggcgtg gcaatgcggg acagagcaac tacggctttg ccaatttccg 624 0 
ccatggagcg tatctgtgag aaacgccggc acgaaggcct cccaggcctg gccgtgcagt 63 00 
ggggcgccat cggcgacgtg ggcattttgg tggagacgat gagcaccaac gacacgatcg 6360 
tcagtggcac gctgccccag cgcatggcgt cctgcctgga ggtgctggac ctcttcctga 6420 
accagcccca catggtcctg agcagctttg tgctggctga gaaggctgcg gcctataggg 6480 
acagggacag ccagcgggac ctggtggagg ccgtggcaca catcctgggc atccgcgact 654 0 
tggctgctgt caacctggac agctcactgg cggacctggg cctggactcg ctcatgagcg 6600 
tggaggtgcg ccagacgctg gagcgtgagc tcaacctggt gctgtccgtg cgcgaggtgc 6660 
ggcaactcac gctccggaaa ctgcaggagc tgtcctcaaa ggcggatgag gccagcgagc 672 0 
t ggg ca tgcc ccacgcccaa ggaggatggt ctggcccagc agcagactca gctgaacctg 6780 
cgctccctgc tggtgaaccc ggagggcccc accctgatgc ggctcaactg ccgtgcagag 684 0 
ctcggagcgg cccctgttcc tggtgcaccc aattcgaggg ctccaccacc gtgttccaca 6900 
gcctggcctc ccggctcagc atccccacct atggcctgca gtgcacccga gctgcgcccc 6960 
ttgacagcat ccacagcctg gctgcctact acatcgactg catcaggcag gtgcagcccg 7020 
agggccccta ccgcgtggcc ggctactcct acggggcctg cgtggccttt gaaatgtgct 7080 
cccagctgca ggcccagcag agcccagccc ccacccacaa cagcctcttc ctgttcgacg 7140 
gctcgcccac ctacgtactg gcctacaccc agagctaccg ggcaaagctg accccaggct 7200 
gtgaggctga ggctgagacg gaggccatat gcttcttcgt gcagcagttc acggacatgg 7260 
agcacaacag ggtgctggag gcgctgctgc cgctgaaggg cctagaggag cgtgtggcag 732 0 
ccgccgtgga cctgatcatc aagagccacc agggcctgga ccgccaggag ctgagctttg 73 80 
cggcccggtc cttctactac aagctgcgtg ccgctgagca gtacacaccc aaggccaagt 744 0 
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accatggcaa cgtgatgcta ctgcgcgcca agacgggtgg cgcctacggc gaggacctgg 7500 
gcgcggacta caacctctcc caggtatgcg acgggaaagt atccgtccac gtcatcgagg 7560 
gtgaccaccg cacgctgctg gagggcagcg gcctggagtc catcatcagc atcatccaca 7620 
gctccctggc tgagccacgc gtgagcgtgc gggagggcta ggcccgtgcc cccgcctgcc 7680 
accggaggtc actccaccat ccccacccca tcccacccca cccccgccat gcaacgggat 7740 
tgaagggtcc tgccggtggg accctgtccg gcccagtgcc actgcccccc gaggctagct 7800 
agacgtaggt gttaggcatg tcccacccac ccgccgcctc ccacggcacc tcggggacac 7860 
cagagctgcc gacttggaga ctcctggtct gtgaagagcc ggtggtgccc gtgcccgcag 7920 
gaactggggc tgggcctcgt gcgcccgtgg ggtctgcgct tggtctttct gtgcttggat 7 980 
ttgcatattt attgcattgc tggtagagac ccccaggcct gtccaccctg ccaagactcc 8040 
tcaggcagcg tgtgggtccc gcactctgcc cccatttccc cgatgtcccc tgcgggcgcg 8100 
ggcagccacc caagcctgct ggctgcggcc ccctctcggc caggcattgg ctcagcccgc 8160 
tgagtggggg gtcgtgggcc agtccccgag gactgggccc ctgcacaggc acacagggcc 8220 
cggccacacc cagcggcccc ccgcacagcc acccgtgggg tgctgccctt atgcccggcg 82 80 
ccgggcacca actccatgtt tggtgtttgt ctgtgtttgt ttttcaagaa atgattcaaa 8340 
ttgctgcttg gattttgaaa tttactgtaa ctgtcagtgt acacgtctgg accccgtttc 8400 
atttttacac caatttggta aaaatgctgc tctcagcctc ccacaattaa accgcatgtg 8460 
atctccaaaa 8470 

<210> 11 

<211> 812 

<212> DNA 

<213> Homo sapiens 

<400> 11 

gccgcagcca atcagcgcgc gtgcccgggc ccctgcgtct cttgcgtcaa gacggccgtg 60 
ctgagcgaat gcaggcgact tgcgagctgg gagcgattta aaacgctttg gattcccccg 12 0 
gcctgggtgg ggagagcgag ctgggtgccc cctagattcc ccgcccccgc acctcatgag 180 
ccgaccctcg gctccatgga gcccggcaat tatgccacct tggatggagc caaggatatc 24 0 
gaaggcttgc tgggagcggg aggggggcgg aatctggtcg cccactcccc tctgaccagc 3 00 
cacccagcgg cgcctacgct gatgcctgct gtcaactatg cccccttgga tctgccaggc 3 60 
tcggcggagc gccaaagcaa tgccacccat gccctggggt gccccagggg acgtccccag 420 
ctcccgtgcc ttatggttac tttggaggcg ggtactactc ctgccgagtg tcccggagct 480 
cgctgaaacc ctgtgcccag gcagccaccc tggccgcgta ccccgcggag actcccacgg 54 0 
ccggggaaga gtaccccagc cgccccactg agtttgcctt ctatccggga tatccgggaa 600 
cctaccagcc tatggccagt tacctggacg tgtctgtggt gcagactctg ggtgctcctg 660 
gagaaccgcg acatgactcc ctgttgcctg tggacagtta ccagtcttgg gctctcgctg 720 
gtggctggaa cagccagatg tgttgccagg gagaacagaa cccaccaggt cccttttgga 780 
aggcagcatt tgcagactcc agcgggcagc ac 812 

<210> 12 

<211> 2385 

<212> DNA 

<213> Homo sapiens 

<400> 12 

ataagctggg gtaaagtatt ttcgcagttt ctgcctttag gattttatta gcttctctcc 60 
cccaggccgc agccaatcag cgcgcgtgcc cgggcccctg cgtctcttgc gtcaagacgg 120 
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ccgtgctgag cgaatgcagg cgacttgcga gctgggagcg atttaaaacg ctttggattc 180 
ccccggcctg ggtggggaga gcgagctggg tgccccctag attccccgcc cccgcacctc 240 
atgagccgac cctcggctcc atggagcccg gcaattatgc caccttggat ggagccaagg 3 00 
atatcgaagg cttgctggga gcgggagggg ggcggaatct ggtcgcccac tcccctctga 3 60 
ccagccaccc agcggcgcct acgctgatgc ctgctgtcaa ctatgccccc ttggatctgc 420 
caggctcggc ggagccgcca aagcaatgcc acccatgccc tggggtgccc caggggacgt 4 80 
ccccagctcc cgtgccttat ggttactttg gaggcgggta ctactcctgc cgagtgtccc 54 0 
ggagctcgct gaaaccctgt gcccaggcag ccaccctggc cgcgtacccc gcggagactc 600 
ccacggccgg ggaagagtac cccagccgcc ccactgagtt tgccttctat ccgggatatc 660 
cgggaaccta ccagcctatg gccagttacc tggacgtgtc tgtggtgcag actctgggtg 72 0 
ctcctggaga accgcgacat gactccctgt tgcctgtgga cagttaccag tcttgggctc 780 
tcgctggtgg ctggaacagc cagatgtgtt gccagggaga acagaaccca ccaggtccct 84 0 
tttggaaggc agcatttgca gactccagcg ggcagcaccc tcctgacgcc tgcgcctttc 900 
gtcgcggccg caagaaacgc attccgtaca gcaaggggca gttgcgggag ctggagcggg 960 
agtatgcggc taacaagttc atcaccaagg acaagaggcg caagatctcg gcagccacca 1020 
gcctctcgga gcgccagatt accatctggt ttcagaaccg ccgggtcaaa gagaagaagg 1080 
ttctcgccaa ggtgaagaac agcgctaccc cttaagagat ctccttgcct gggtgggagg 1140 
agcgaaagtg ggggtgtcct ggggagacca ggaacctgcc aagcccaggc tggggccaag 12 00 
gactctgctg agaggcccct agagacaaca cccttcccag gccactggct gctggactgt 1260 
tcctcaggag cggcctgggt acccagtatg tgcagggaga cggaacccca tgtgacagcc 132 0 
cactccacca gggttcccaa agaacctggc ccagtcataa tcattcatcc tgacagtggc 13 80 
aataatcacg ataaccagta ctagctgcca tgatcgttag cctcatattt tctatctaga 1440 
gctctgtaga gcactttaga aaccgctttc atgaattgag ctaattatga ataaatttgg 1500 
aaggcgatcc ctttgcaggg aagctttctc tcagaccccc ttccattaca cctctcaccc 1560 
tggtaacagc aggaagactg aggagagggg aacgggcaga ttcgttgtgt ggctgtgatg 1620 
tccgtttagc atttttctca gctgacagct gggtaggtgg acaattgtag aggctgtctc 1680 
ttcctccctc cttgtccacc ccatagggtg tacccactgg tcttggaagc acccatcctt 1740 
aatacgatga tttttctgtc gtgtgaaaat gaagccagca ggctgcccct agtcagtcct 1800 
tccttccaga gaaaaagaga tttgagaaag tgcctgggta attcaccatt aatttcctcc 1860 
cccaaactct ctgagtcttc ccttaatatt tctggtggtt ctgaccaaag caggtcatgg 192 0 
tttgttgagc atttgggatc ccagtgaagt agatgtttgt agccttgcat acttagccct 1980 
tcccaggcac aaacggagtg gcagagtggt gccaaccctg ttttcccagt ccacgtagac 2 04 0 
agattcacgt gcggaattct ggaagctgga gacagacggg ctctttgcag agccgggact 2100 
ctgagaggga catgagggcc tctgcctctg tgttcattct ctgatgtcct gtacctgggc 2160 
tcagtgcccg gtgggactca tctcctggcc gcgcagcaaa gccagcgggt tcgtgctggt 2220 
ccttcctgca ccttaggctg ggggtggggg gcctgccggc gcattctcca cgattgagcg 2280 
cacaggcctg aagtctggac aacccgcaga accgaagctc cgagcagcgg gtcggtggcg 2340 
agtagtgggg tcggtggcga gcagttggtg gtgggccgcg gccgc 23 85 

<210> 13 

<211> 221 

<212> DNA 

<213> Homo sapiens 

<400> 13 

dsdnrstatc tttctgtgtg gtgcagccct gttggcagtg ggcatctggg tgtcaatcga 60 

tggggcatcc tttctgaaga tcttcgggcc actgtcgtcc agtgccatgc agtttgtcaa 120 

cgtgggctac ttcctcatcg cagccggcgt tgtggtcttt gctcttggtt tcctgggctg 180 

14 



SUBSTITUTE SHEET (RULE 26) 



BNSDOCID: <WO 00231 1 1 A1J_> 




PCT/US99/24331 



221 



WO 00/23111 

ctatggtgct aagactgaga gcaagtgtgc cctcgtgacg t 

<210> 14 

<211> 1533 

<212> DNA 

<213> Homo sapiens 



<400> 14 

gggcacgcag acattctggg aagccacttg ccccacccct gggctgcttc ttcttgagat 60 
caggaggggc gttgcccagg gctggtgttg ccaggtggag gcctgctgag gcagtggttg 120 
tggggatcgg tctccaggca gcagggggca gcagggtcaa ggagaggcta actggccacg 180 
ggtggggcca gcaggcgggc agaaggaggc tttaaagcgc ctaccctgcc tgcaggtgag 240 
cagtggtgtg tgagagccag gccgtccctc tgcctgccca ctcagtggca acacccggga 3 00 
gctgttttgt cctttgtgga gcctcagcag ttccctgctt tcagaactca ctgccaagag 360 
ccctgaacag gagccaccat ggcagtgctt cagcttcatt aagaccatga tgatcctctt 42 0 
caatttgctc atctttctgt gtggtgcagc cctgttggca gtgggcatct gggtgtcaat 480 
cgatggggca tcctttctga agatcttcgg gccactgtcg tccagtgcca tgcagtttgt 54 0 
caacgtgggc tacttcctca tcgcagccgg cgttgtggtc tttgctcttg gtttcctggg 600 
ctgctatggt gctaagactg agagcaagtg tgccctcgtg acgttcttct tcatcctcct 660 
cctcatcttc attgctgagg ttgcagctgc tgtggtcgcc ttggtgtaca ccacaatggc 720 
tgagcacttc ctgacgttgc tggtagtgcc tgccatcaag aaagattatg gttcccagga 780 
agacttcact caagtgtgga acaccaccat gaaagggctc aagtgctgtg gcttcaccaa 84 0 
ctatacggat tttgaggact caccctactt caaagagaac agtgcctttc ccccattctg 900 
ttgcaatgac aacgtcacca acacagccaa tgaaacctgc accaagcaaa aggctcacga 960 
ccaaaaagta gagggttgct tcaatcagct tttgtatgac atccgaacta atgcagtcac 1020 
cgtgggtggt gtggcagctg gaattggggg cctcgagctg gctgccatga ttgtgtccat 1080 
gtatctgtac tgcaatctac aataagtcca cttctgcctc tgccactact gctgccacat 1140 
gggaactgtg aagaggcacc ctggcaagca gcagtgattg ggggagggga caggatctaa 1200 
caatgtcact tgggccagaa tggacctgcc ctttctgctc cagacttggg gctagatagg 1260 
gaccactcct tttaggcgat gcctgacttt ccttccattg. gtgggtggat gggtgggggg 1320 
cattccagag cctctaaggt agccagttct gttgcccatt cccccagtct attaaaccct 1380 
tgatatgccc cctaggccta gtggtgatcc cagtgctcta ctgggggatg agagaaaggc 144 0 
attttatagc ctgggcataa gtgaaatcag cagagcctct gggtggatgt gtagaaggca 1500 
cttcaaaatg cataaacctg ttacaatgtt gcc 1533 

<210> 15 
<211> 472 
<212> DNA 

<213> Homo sapiens 
<400> 15 

tcagagaaaa ctcaaacttt attgagagaa ttttcaaatt ttcagtcaca ttttcaatgt 60 
gacatcagcc atgtgtgtag cttcagcttg tcttcttttt aacttatggc tgcccatctc 120 
ctgcttcttt agtcttagca tgcttaggat taggtggagt cttctctttt acatcagagc 180 
catctccacg ctcactccga gtcttttcca gatccatttc ctggcaatca ccttctactt 240 
tacgttcttc gatcggaggt gttccttctc tctcttgtcc aggttcaata tcctgattgt 300 
cagttggtgg ttcctcttgc tgagattcac cgggagccac gaatgcaacc acatcgggag 360 
cctcctgacc atctcctctt cctctggatc ttgatctcac tcgtgcactc atcgctgcaa 42 0 

15 
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ctagaagatc 



gtgaactgaa gaacttgagt cagcagagag cctggcgaag aa 



472 



<210> 16 

<211> 478 ^ 
<212> DNA 

<213> Homo sapiens 
<400> 16 

cttcattctt cgccaggctc tctgctgact caagttcttc agttcacgat cttctagttg 60 
cagcgatgag tgcacgagtg agatcaagat ccagaggaag aggagatggt caggaggctc 120 
ccgatgtggt tgcattcgtg gctcccggtg aatctcagca agaggaacca ccaactgaca 180 
atcaggatat tgaacctgga caagagagag aaggaacacc tccgatcgaa gaacgtaaag 24 0 
tagaaggtga ttgccaggaa atggatctgg aaaagactcg gagtgagcgt ggagatggct 300 
ctgatgtaaa agagaagact ccacctaatc ctaagcatgc taagactaaa gaagcaggag 3 60 
atgggcagcc ataagttaaa aagaagacaa gctgaagcta cacacatggc tgatgtcaca 42 0 
ttgaaaatgt gactgaaaat ttgaaaattc tctcaataaa gtttgagttt tctctgaa 478 

<210> 17 
<211> 198 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> unsure 
<222> (191) 



cccgctgtac caccccagca tgttctgcgc cggcggaggg caagaccaga aggactcctg 60 
caacggtgac tctggggggc ccctgatctg caacgggtac ttgcagggcc ttgtgtcttt 120 
cggaaaagcc ccgtgtggcc aagttggcgt gccaggtgtc tacaccaacc tctgcaaatt 180 



<210> 18 

<211> 465 

<212> DNA 

<213> Homo sapiens 

<400> 18 

fc 99 a 9 at 99 a gtatgtattt attttacaaa aataaatcac catcttcgga ccatttgtag 60 
actggaacat ttcgagcaat gagtgcgcca cacggacgag tgccctggtg actccctgat 120 
gttcgcgtca cccccagggc caccttggcg cccgcatgag cctcgcttcc cactcccggc 180 
ctccaactcc cttccctcgc agccgccatt caccttctgc tgtttatttg tctgcagagc 240 
gcctggacac cggaaaaggc gattccctga gcgcctggag ttggagacaa ttcctggttc 300 
agaatttaaa catctttcta aggtaagcgc tgctccaaaa ctcttcgccg cgtggggact 3 60 
ttgcaccagg ggcggttggg aaggaagttg gccctccacg ggttcctggg caaccgcggc 420 
ctgttgaaaa aaggttctgg gtcaaataat ttaacttcgg aggag 465 



<400> 17 



cactgagtgg 



nattaagg 



198 



<210> 19 
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<211> 204 
<212> DNA 

<213> Homo sapiens 
<400> 19 

ggcgggaaca ggcggcgctg gacctgtacc 
ccttctcctc ccccaacttc gccaccatcc 
cctctcccag ccacccggcc aactccttct 
tcgccagggt gacactggtg cggc 

<210> 20 
<211> 294 
<212> DNA 

<213> Homo sapiens 



cctacgacgc cgggacggac agcggcttca 60 
cgcaggacac ggtgaccgag ataacgtcct 120 
actacccgcg gctgaaggcc ctgcctccca 180 

204 



<220> 

<221> unsure 
<222> (287) 



<400> 20 

gagatttctc ttcaatggct tcctgtgagc tagagtttga aaatatctta aaatcttgag 60 
ctagagatgg aagtagcttg gacgattttc attatcatgt aaatcgggtc actcaagggg 12 0 
ccaaccacag ctgggagcca ctgctcaggg gaaggttcat atgggacttt ctactgccca 180 
aggttctata caggatataa aggtgcctca cagtatagat ctggtagcaa agtaagaaga 240 
aacaaacact gatctctttc tgccacccct ctgacccttt ggaactnctc tgac 294 

<210> 21 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 21 

atcagaacaa agaggctgtg tc 22 

<210> 22 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 22 

atctctaaag ccccaacctt c 
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<210> 23 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 23 

tgccgaagag gttcagtgc 

<210> 24 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 24 

gccacagtgg tactgtccag at 

<210> 25 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 25 

gctgcaagtt ctccacattg a 

<210> 26 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 26 

cagccgcagg tgaaacac 

<210> 27 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 



<223> Description of Artificial Sequence : Synthetic 
<400> 27 

tggctttgaa ctcagggtca 

<210> 28 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 28 

cggatgcacc tcgtagacag 

<210> 29 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 29 

cggcaacctg gtagtgagtg 

<210> 30 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 30 

cgcagctcct tgtaaacttc ag 

<210> 31 
<211> 20 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : Synthetic 
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<400> 31 



cgggaaccta ccagcctatg 



20 



<210> 32 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 32 

caggcaacag ggagtcatgt 20 

<210> 33 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 



<210> 34 

<211> 19 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 

<400> 34 

cggctgcgat gaggaagta 

<210> 35 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 35 

gcccatctcc tgcttcttta gt 
<210> 36 



<400> 33 



tgggcatctg ggtgtcaa 
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<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Synthetic 
<400> 36 

cgtggagatg gctctgatgt a 21 
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